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ATGAGCCAGC 
TACCCTCTTC 
GAGAGAAACT 
GTGGTGTTTG 
TAAAGACTAC 
TCCAGGTCCC 
GTCAATGGCC 
GGGGGGTGTT 
TGGTGGGTAA 
TTCAACAATG 
TGAGAGGCTG 
AGATGGGTCT 
GAGTCCTTCA 
TTCGGTGCTC 
CAGGGTCCAT 
ACCAACATCG 
GCTGGTGAAG 
TCCCTACAGA 
TGTAAGATGC 
CGGGTTCCCC 
TAGGCACAGC 
ATGGGCCGGA 
GGAGATGATC 
TTCATGTCAT 
GGAGGAAAAT 
CACCATGCTG 
TAGGAGCCCT 
GACCCCTACT 
AGTGAGCTTC 
TGGGTGTCGC 
AATGGCTATG 
CAAGACCTAT 
ACTGCTCCCC 
ATCGCCAAGA 
GCCCCCCACC 
TGTCCTATAT 
CCACCAGCCT 
CGTCCCACCC 
TCAGCTTCGT 
ACCTATGGGA 
GGTGTACAAT 
AATGCAAGCA 
GCAAATGCTA 
GGATGCTGAG 
GGGACTTAGA 
ACCGCCCTGT 



CCAGGCCCCG 
GACGATGAGT 
TCGCAATGCC 
GGCTGCTGCC 
ATCATTCCTG 
ACAAGGCATG 
TCTACTCCTC 
CACCAGATGG 
CATCTGTCTG 
CCACCAATGA 
CACGTGTCAG 
GGGCTTCATG 
TCCGGGGCTT 
AAGTACATCT 
CGTCTTTACC 
CCTCGCTCAT 
GAGCTCAATG 
GATGATTGTG 
CCAAAAAGTA 
ACCCCGGTGT 
CTTCTCCCTA 
CCCTGGCCAA 
GCTCTCGGCT 
TTGCTGTGCG 
CCCAGGTGGC 
GTCCTGGGGA 
GATCGCTGTC 
ACCTGTGGAG 
CTCTCCTCCT 
CTTCTCCGTC 
CACTGGCCCA 
AATAGGGCCC 
TCT.CTACTTT 
CTGTCTCCCT 
GACCCCAACA 
CACCTTCAGC 
CCGCTGAGGC 
TTCGTCACCT 
GGACTTGATG 
AGATCGGCGT 
GACATTAGCC 
CGTCTTTCCC 
GAGACGTGAC 
CTCTCCTTGT 
GCAGGAGATG 
GA 



CTACGTGGTA 
TTGAGAAGAA 
TTCAGATGTT 
TGTGCTCTCC 
ACCTGCTCGG 
GCATTTGCTC 
CTTCTTCCCC 
TGCCAGGTAC 
CAGCTGGCCC 
GAGCTATGTG 
CTACGCTAGC 
CAGTTTGGCT 
CATGACGGCC 
TCGGACTGAC 
TTCATTGACA 
CTTCGCTCTC 
CTCGCTACAT 
GTGGTGGTGG 
TCACATGCAG 
CGCCTGTGGT 
GCCATCGTGA 
CAAGCACGGC 
GCAGCAACTT 
CTTTCTGTCA 
CAGCCTGTGT 
TCTATCTGTA 
AATCTCAAGA 
GAAGAGCAAG 
TCTTCCTCAG 
CTGGTCGTGG 
GGTCATGGAC 
AGGATATCCA 
GCCAACTCAG 
GCAGGAGCTG 
ACAACCAGAC 
CCTGACAGCT 
CCCCGGCGAG 
TCCACACCCT 
GGCATCAAGG 
GAAGGTCTTC 
ATGGAGGCGT 
AGCATACATG 
CCCAGGACAC 
ACGACTCAGA 
TTCGGGAGCA 



GACAGAGCCG 
GGACCGGACA 
CCTCAGCCAA 
TGGCTCCCCA 
TGGACTCAGC 
TGCTGGCCAA 
CTCCTGACCT 
CTTTGCCGTT 
CAGAGTCGAA 
GACACAGCAG 
CTGCCTCACC 
TTGTGGCCAT 
GCCGGCCTGC 
CATCCCCTCC 
TTTGCAAAAA 
ATCAGCGGTG 
GCACAAGATT 
CAACAGCTAT 
ATCGTGGGAG 
CTCACAGTGG 
GCTACGTCAT 
TACGACGTGG 
CTTTGGCTCC 
CTCTGGCTGT 
GTGTCTCTGG 
TCCTCTCCCT 
ACTCCCTCAA 
CTGGACTGTT 
CCTGCCCTAT 
TCTTCCAGAC 
ACTGACATTT 
GGGGATTAAA 
AGATCTTCAG 
CAGCAGGACT 
CCCGGCTAAC 
CCTCACCTGC 
CCCAGTGACA 
CATCCTGGAC 
CCCTGGCCAA 
TTGGTGAACA 
CTTTGAGGAT 
ACGCAGTCCT 
AACTTCCAAG 
GGAGGACATT 
TGTTTCACGC 



CATACTCCCT 
TACCCAGTGG 
GATCAAAGCT 
AGTACAAGAT 
GGGGGATCCA 
CCTTCCTGCA 
ACTTCTTCCT 
ATCAGCATCC 
ATTCCAGGTC 
CCATGGAGGC 
GCCATCATCC 
CTACCTCTCC 
AGATCCTGAT 
TACACAGGCC 
CCTCCCCCAC 
CCTTCCTGGT 
CGCTTCCCCA 
CTCCGGGGGC 
AAATCCAACG 
AAGGACATGA 
CAACCTGGCT 
ATTCGAACCA 
TTCTTTAAAA 
GGATGGAGCT 
TGGTGATGAT 
AAGTCTGTGC 
GCAACTCACC 
GCATCTGGGT 
GGTGTGGCAG 
TCAGTTTCGA 
ATGTGAATCC 
ATCATCACGT 
GCAAAAGGTC 
TTGAGAATGC 
GGCACCAGCG 
CCAGAGTGAG 
TGCTGGCCAG 
ATGAGTGGAG 
GCTGAGCTCC 
TCCATGCCCA 
GGGAGTCTAG 
CTTTGCCCAG 
GGGCTCCAGG 
CGCAGCTACT 
AGAGACCCTG 



FEATURES : 

Start Codon: 
Stop Codon: 



1 

2260 
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HOMOLOGOUS PROTEINS: 

Top BLAST Hits: 



gb|AAF81911.1 
gb|AAF71715.1 
ref |NP_000432 
ref |NP_035997 
ref |NP_062087 
ref |NP_067328 
ref |NP_000102 
sp | 070531 | DTD 
ref INP_000103" 
reflNP 031911 



1AF279265_1 (AF279265) putative anion transpor. 
|AF230376_1 (AF230376) prestin [Meriones ungui. 
.11 pendrin [Homo sapiens) >gi 1 11421915 I ref I XP. 
.11 Pendred syndrome homolog (human); Pendred'. 
.11 Pendred syndrome homolog (human) [Rattus n. 
.11 down- regulated in adenoma [Mus musculus] >. 
.11 down- regulated in adenoma protein [Homo sa. 
_RAT SULFATE TRANSPORTER (DIASTROPHIC DYSPLASI. 
.11 sulfate anion transporter 1; Diastrophic d. 
.11 diastrophic dysplasia [Mus musculus] >gi|2. 



Score 


E 


476 


e-133 


471 


e-131 


451 


e-125 


448 


e-124 


447 


e-124 


434 


e-120 


418 


e-115 


365 


le-99 


362 


le-98 


357 


4e-97 



BLAST to dbEST: 



git 8630793 /dataset=dbest /taxon=960. 



Score E 
523 e-146 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Expression information from BLAST dbEST hits: 
gi | 8 6307 93 Human head-neck 



Expression information from PCR-based tissue screening panels: 
Human fetal lung 
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1 MSQPRPRYVV DRAAYSLTLF DDEFEKKDRT YPVGEKLRNA FRCSSAKIKA 

51 WFGLLPVLS WLPKYKIKDY IIPDLLGGLS GGSIQVPQGM AFALLANLPA 

101 VNGLYSSFFP LLTYFFLGGV HQMVPGTFAV ISILVGNICL QLAPESKFQV 

151 FNNATNESYV DTAAMEAERL HVSATLACLT AIIQMGLGFM QFGFVAIYLS 

201 ESFIRGFMTA AGLQILISVL KYIFGLTIPS YTGPGSIVFT FIDICKNLPH 

251 TNIASLIFAL ISGAFLVLVK ELNARYMHKI RFPIPTEMIV WVATAISGG 

301 CKMPKKYHMQ IVGEIQRGFP TPVSPVVSQW KDMIGTAFSL AIVSYVINLA 

351 MGRTLANKHG YDVDSNQEMI ALGCSNFFGS FFKIHVICCA LSVTLAVDGA 

401 GGKSQVASLC VSLWMITML VLGIYLYPLP KSVLGALIAV NLKNSLKQLT 

451 DPYYLWRKSK LDCCIWWSF LSSFFLSLPY GVAVGVAFSV LVWFQTQFR 

501 NGYALAQVMD TDIYVNPKTY NRAQDIQGIK IITYCSPLYF ANSEIFRQKV 

551 IAKTVSLQEL QQDFENAPPT DPNNNQTPAN GTSVSYITFS PDSSSPAQSE 

601 PPASAEAPGE PSDMLASVPP FVTFHTLILD MSGVSFVDLM GIKALAKLSS 

651 TYGKIGVKVF LVNIHAQVYN DISHGGVFED GSLECKHVFP SIHDAVLFAQ 

701 ANARDVTPGH NFQGAPGDAE LSLYDSEEDI RSYWDLEQEM FGSMFHAETL 

751 TAL 



FEATURES : 

Functional domains and key regions : 

[1] PDOC00001 PS00001 ASN__GLYCOS YLAT ION 
N-glycosylation site 



Number of matches: 3 

1 153-156 NATN 

2 156-159 NESY 

3 580-583 NGTS 



[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 



Number of matches: 2 

1 45-47 SAK 

2 445-447 SLK 



[3] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 



Number of 


matches : 


11 


1 


18-21 


TLFD 


2 


158-161 


SYVD 


3 


240-243 


TFID 


4 


365-368 


SNQE 


5 


459-462 


SKLD 


6 


556-559 


SLQE 


7 


635-638 


SFVD 


8 


691-694 


SIHD 


9 


722-725 


SLYD 


10 


726-729 


SEED 


11 


732-735 


SYWD 



[4] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

Number of matches: 2 

1 7-15 RYWDRAAY 

2 447-454 KQLTDPYY 
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[5] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 



Number of 


matches : 


10 


1 


77-82 


GGLSGG 


2 


78-83 


GLSGGS 


3 


89-94 


GMAFAL 


4 


103-108 


GLYSSF 


5 


335-340 


GTAFSL 


6 


435-440 


GALIAV 


7 


481-486 


GVAVGV 


8 


485-490 


GVAFSV 


9 


581-586 


GTSVSY 


10 


681-686 


GSLECK 



Membrane spanning structure and domains: 





Helix 


Begin 


End 


Score 


Certainty 




1 


51 


71 


0. 


.893 


Putative 




2 


82 


102 


1. 


.020 


Certain 




3 


107 


127 


1. 


.729 


Certain 




4 


130 


150 


1. 


.497 


Certain 




5 


186 


206 


1. 


.723 


Certain 




6 


228 


248 


1. 


.517 


Certain 




7 


256 


276 


1. 


.898 


Certain 




8 


288 


308 


1. 


.252 


Certain 




9 


338 


358 


1. 


.568 


Certain 




10 


383 


403 


1. 


.304 


Certain 




11 


412 


432 


2. 


.345 


Certain 




12 


469 


489 


1. 


.997 


Certain 




13 


619 


639 


1. 


.146 


Certain 
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BLAST Alignment to Top Hit: 

>gb| AAF81911. 1 |AF279265_1 (AF279265) putative anion transporter 1 [Homo 
sapiens] 

Length = 738 
Score = 476 bits (1224), Expect = e-133 

Identities = 263/724 (36%), Positives = 428/724 (58%), Gaps = 36/724 (4%) 
Frame = +3 

Query: 54 LFDDEFEEKDR — TYPVGEKLRNAFRCSSAKIKAWFGLLPVLSWLPKYKIKDYIIPDLL 227 

L + EER+P +R +CS A+ A++ LPVL WLP+Y ++D+++ DLL 
Sbjct: 15 LNQEHLEELGRWGSAPRTHQWRTWLQCSRARAYALLLQHLPVLVWLPRYPVRDWLLGDLL 74 

Query: 228 GGLSGGSIQVPQGMAFALLANLPAVNGLYSSFFPLLTYFFLGGVHQMVPGTFAVISILVG 407 

GLS +Q+PQG+A+ALLA LP V GLYSSF+P+ YF G + GTFAV+S++VG 
Sbjct: 75 SGLSVAIMQLPQGLAYALLAGLPPVFGLYSSFYPVFIYFLFGTSRHISVGTFAVMSVMVG 134 

Query: 408 NICLQLAPESKFQVFNNATNESYVDTAAMEAERLHVSATLACLTAIIQMGLGFMQFGFVA 587 

++ LAP+ A N+S ++ A +A R+ V++TL+ L + Q+GLG + FGFV 

Sbjct: 135 SVTESLAPQ ALNDSMINETARDAARVQVASTLSVLVGLFQVGLGLIHFGFW 186 

Query: 588 IYLSESFIRGFMTAAGLQILISVLKYIFGLTIPSYTGPGSIVFTFIDICKNLPHTNIASL 7 67 

YLSE +RG+ TAA +Q+ +S LKY+FGL + S++GP S+++T +++C LP + + ++ 
Sbjct: 187 TYLSEPLVRGYTTAAAVQVFVSQLKYVFGLHLSSHSGPLSLIYTVLEVCWKLPQSKVGTV 246 

Query: 768 I FALI SGAFLVLVKELNARYMHKIRFPI PTEMI WWATAISGGCKMPKKYHMQIVGEIQ 947 

+ A ++G LV+VK LN + ++ PIP E++ ++ AT IS G + ++ + +VG I 
Sbjct: 247 VTAAVAGWLVWKLLNDKLQQQLPMPIPGELLTLIGATGISYGMGLKHRFEVDWGNIP 306 

Query: 948 RGFPTPVSPWLQWKDMIGTAFSLAIVSYVINLAMGRTLANKHGYDVDSNQEMIALGCSN 1127 

G PV+P + ++G+AF++A+V + I +++G+ A +HGY VDSNQE++ALG SN 
Sbjct: 307 AGLVPPVAPNTQLFSKLVGSAFTIAWGFAIAISLGKIFALRHGYRVDSNQELVALGLSN 366 

Query: 1128 FFGSFFKIHVICCALSVTLAVDGAGGKSQVASLCVSLWMITMLVLGIYLYPLPKSVLGA 1307 

G F+ + C++S +L + GG SQVA SL +++ ++ LG + LPK+VL A 
Sbjct: 367 LIGGIFQCFPVSCSMSRSLVQESTGGNSQVAGAISSLFILLIIVKLGELFHDLPKAVLAA 426 

Query: 1308 LIAVNLKNSLKQLTDPYYLWRKSKLDCCIWWSFLSSFFLSLPYGVAVGVAFSVLWVFQ 1487 

+1 VNLK L+QL+D LW+ ++ D IW+V+F ++ L+L G+ V V FS+L+W + 
Sbjct: 427 IIIVNLKGMLRQLSDMRSLWKANRADLLIWLVTFTATILLNLDLGLWAVIFSLLLVWR 486 

Query: 1488 TQFRNGYALAQVMDTDIYVNPKTYNRAQDIQGIKIITYCSPLYFANSEIF 1637 

TQ + L QV DTDIY + Y+ A++++G+K+ + +YFAN+E + 
Sbjct: 487 TQMPHYSVLGQVPDTDIYRDVAEYSEAKEVRGVKVFRSSATVYFANAEFYSDALKQRCGV 546 

Query: 1638 RQKVIAK — TVSLQELQQDFE-NAPPTDPNNNQTPAN-GTSVSYI 1760 

++K++ K + L++LQ++ + P N TS+ + 

Sbjct: 547 DVDFLISQKKKLLKKQEQLKLKQLQKEEKLRKQAASPKGASVSINVNTSLEDMRSNNVED 606 

Query: 1761 TFSPDSSSPAQSEPPASAEAPGEPSDMLASVPPFVTFHTLILDMSGVSFVDLMGI 1925 

S D A + ++AP + S + A P FH+LILD+ +SFVD + + 

Sbjct: 607 CKMMQVSSGDKMEDATANGQEDSKAP-DGSTLKALGLPQPDFHSLILDLGALSFVDTVCL 665 

Query: 1926 KALAKLSSTYGKIGVKVFLVNIHAQVYNDISHGGVFEDGSLECKHVFPSIHDAVLFAQAN 2105 

K+L + + +1 V+V++ H+ V + + G F D S+ KH+F S+HDAV FA + 
Sbjct: 666 KSLKN I FHDFRE IEVEVYMAACHS PVVSQLEAGHFF- DAS ITKKHLFASVHDAVT FALQH 724 

Query: 2106 ARDV 2117 
R V 

Sbjct: 725 PRPV 728 
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Hmmer search results (Pfam) : 

Model Description Score E-value N 

PF00916 Sulfate transporter family 254.5 1.5e-72 1 

PF00189 Ribosomal protein S3, C-terminal domain. 3.3 8 1 



Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

PF00916 1/1 187 497 .. 1 328 [] 254.5 1.5e-72 

PF00189 1/1 651 661 79 89 . ] 3.3 8 
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1 CTGGGTTCCT ATGTGGGGAG GTCATGCTCC 
51 GGCAAACCAC CTGGACAGCC AGACCCATGC 
101 AGGAAGAGTG AGACCACCCC GCCTCACGGG 
151 CTGAATAGTC TCTGCTAGGA GGTAGAAAGC 
201 GTAATCATCG CCACTACCAT TTACTGGGTG 
251 TCATACACAT GATCTCACTG AATCCTCATA 
301 TTATCCCCAT TTACAGATGA AGAAACTGAA 
351 GGCTCTCAAA CTTGTGCTGT TTTCCCTAAG 
401 CCCACTGAAA TGTCTCACAT GCCATTGCCC 
451 CTCCTCCAAA ACACCATTTA TCAATTCGCT 
501 CACACTAAGG GCCAGGCGAG GGGCTGGGCA 
551 ATTCTCCCAC CTTCGCTTCT GCTGGGTATC 
601 CATCCCACCC TCACCTGTAG TTCAAGTGGA 
651 ATGAATGGAA TGCACCAGCC CAGCCTTCAC 
701 TCATAATAGA AACTCACATT TGCATCACAC 
751 TTCTTATCCA TTAACTCATT TGATCTTCAC 
801 CTGTTACTCC CACTTTAGTG ATACAGAATC 
851 TGACCATTCT GCCTCATTAA TAAAAGCAGG 
901 CCTTCCACAA AAGGCATTAA GCAACCTGCT 
951 GTCACCCAGG CTCTCCTCTG GGAAGTTGGG 
1001 AGTTACTCAT TTTCAACCCC ATCTCAAATC 
1051 CCACCCCACA CTCCCCACCT CCCAGATACA 
1101 CCCATCTCTT TCTTCTCTGT CCTTCTTTCT 
1151 TCTCCCAGCT CTGGGAGGTA GAGGGGAGGT 
1201 GAGGGAGGGG GAAAGGTTCC TACAGGGAAC 
1251 CTGGCACTCA GCTCTGCCCA TCTCAGCTCC 
1301 GCAAAAAGTG AGGAGGAGAG GAGCGGCAGT 
1351 TTAGGCACAG GAAGCCGTGG GAGAGAGAGC 
1401 TTTCCCCACA CACACCATTG TCCCCCTGGG 
1451 AGATGTCTTA TCCAAGAAGG GTCCTCTTGA 
1501 TGCCTCTAGG CAAGCTGTTT TCTGTTTCTT 
1551 GTAGGAGCCT TTCTGCCAGG GAAACTAAGG 
1601 TGTGGGGACA CCAGGGGTCA GGGGAGGGGA 
1651 GTGGGGCCTC CTGCCCTCGT GATTCCCCTT 
1701 TGATGGTGAC GCCACAGGTG TGGAGTGCCA 
1751 CAAAACAGCC AGGGTGAGTC TATGCATCAT 
1801 CACTGCGAGC AGGGAGTCTG ACGGAAAAAC 
1851 ACCTTGCTTT ATCGGGGCGG GGAAGGCCAG 
1901 AGGACCAGAG AGAGAAGGCC TGGGCTGGCA 
1951 ACCCTCCCCT CCTCTGCTTC TCCCAAAGCT 
2001 GCCAGCCCAG GCCCCGCTAC GTGGTAGACA 
2051 CTCTTCGACG ATGAGTTTGA GAAGAAGGAC 
2101 GAAACTTCGC AATGCCTTCA GGTAACTGGT 
2151 TCCTCTGCTC CCTACCAAAA TCCTTTCTGC 
2201 CTGGTATCCC TAAGATGGGG TTAAGGGAAG 
2251 GAATGATGAA TTTAAGATCC TACAACCTCA 
2301 GAGGATGGGG AGCAGGAGCA AGAACCATCC 
2351 CCAAACCCCT GCATGGCATC TCCCATATTC 
2401 TGGGTTTGTT AAGGCATGGT AGATGAGCAT 
2451 GAGCATCAGA GCCCTTACTC CATGCCCTGT 
2501 CTGAAGTTAC CATCACCCCA GGTTCTTTGT 
2551 CTTCCTCCAC TTGGTCCAGA GAATGCCAAA 
2601 ACTTTCCTGA GGGGACCTAC CAGGGTGTAG 
2651 TTTCCACTCT CATCTCCCTG GAAATGCGAT 
2701 CCTAAGTAGG CGCTACACAA GGTGATCCAT 
2751 GGGCTGCGGG TGTCACCCTC CCCATTCCCA 
2801 TGGCCCAGCT ATAGGCCAAC TCACTCTCCC 
2851 CTGATTCAGT TACCTGGATT GAGGTCACTG 
2901 GCAGGTGGAA CTGGTTCAGG CCGGGGGAAT 
2951 AAAAGCCCCA GCCCAGCCCT GTTTCTCTTG 
3001 GTTACAGTCT GTCCTCACAG CTGTGCTCCT 
3051 GTCTTTGTGC CCAAGACTTT AGGGCACAAA 
3101 GCTATTGTCC TAAAAGATTA GGATAATGAA 



CCACTCATTG AGCCCCCCCA 
AGACTCTGGA GCAGGTGGAG 
CGGTGAAGGG CCGGCAGCCT 
ACCCTCCCAT CTTAATCATA 
CCTATAAAAG GCCAGCCTCT 
GCATCTGCCT GCGACTGTTA 
TCTTTGAACC CAGGTCATCT 
CCACCCGGTC TCTCATTTCT 
TTACTCATTT CTGCCCATGT 
CAACAAGTAT GTGTTGAGTA 
CAGGCGCTGG GGGTAGGTTC 
ACCTGTGGGG TCTTGCCGGG 
CCTTGGGATC CCAAGACCAA 
CAACTTGAGC ACAATCTTAT 
TTTACATTTT ACACAACCCC 
AACAACCCTG TGAGATATGT 
TGAGGTTTGA AAAGTAATGC 
ATTAACCCAG GCTCCTGGAC 
CCCCTCTGAC AACCTCCCCT 
GGCATCTCTA GCCCCCAAGT 
TTTTGCCAAA CTGGCCACAG 
AATCCTCACT CTAAGCCTTC 
CTGTGGTCCT CTGAGCAACT 
GGGAGACCCA GTAATTGGAA 
TCCTCCGGGC CTCAGGGGCC 
TGGAACGTCA GCCAGGTTGC 
ACACAAGGGT GGGGGAAAGA 
CGGCAGGTGG ACCATCCTGG 
AAACCTGTTG GTGAAGTTCT 
GGTCATCTCA GCTATCCCCC 
CCAAGCTGAC TGGCTGAATG 
TCTGGGAAGG GAGTATGGCT 
GGGTCCACCT GCTGAATCAA 
TGCCTGGTGC TCAGTGGGGG 
GCCACGTGCT GAGCGCCAAG 
CAGTGCCTGG GAAGGAAGGC 
TTGACAGAGG GAAGGGAGGC 
AATAAAACTC TGCTACTGCA 
CTAGGGAGGG ATGTTCCCTC 
TGTAAATGCC CCAGATATGA 
GAGCCGCATA CTCCCTTACC 
CGGACATACC CAGTGGGAGA 
CCAGAGCCCA GACTTCTGCC 
ACCAGGACAC GGCTTCTGCA 
CCCTGGGGAA GTGAGGTTCT 
TCTGTACTGA GACCCCCAGG 
AGAAGGGTTA TATGGCATTC 
TCAATTCACC CGGGTCTCTC 
CTACGTTATG GAGGGGTGGG 
TCCCTCCTTA CAAAAAATAC 
CCTTTCCCTC CCGGATGTTC 
AGGAGGCCCT AAATTTCTGA 
TCCTACCAGC GCCCAGGGTC 
GGTGGGTATG AAACCTTGTC 
ACCCACACCC CAGGAGGCTG 
GACTCCTGGC AGACCTCCTC 
TCACTCCCTT GGGGAAACGG 
GCAATGGCTG AAGTGGAGAC 
CACCCACTTG AGTTTGTACT 
GGAGGCTCCA TTTCTGCCCA 
CAGACAGGTG GTCTCTGCCA 
GTCTGAGGAT GAGAAGATCT 
AGCTGTAAAG GGATATAGCA 
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3151 AACTAACAAT TCCTATGATA CTGGCATGAG AGCCTTGAAC AGTGCCTGGC 

3201 ATAGAGAAGG TGCACCAATA AATATTTGTT TCATGAATGA ATGAATGAAT 

3251 GAATGTCTAG AAAGCTAATC CCTCTCAGCC TCTGTTTCCA GTTCTTCTTT 

3301 CAAGCTTCAG ATTGCTTTGC CCAACATACA GCAGACTTGC AAGTAAGGTT 

3351 GGGCATGGAC TAGCCCTCAA ATGAGTTGTT TTTCTTTCCC TAGCCAGCTC 

3401 TCTATTCATA AGTCCGGCTT TCTCTGCCAC AAACAGACCT GATGGAGCCC 

3451 CTGCAGGGCT GGTTCTCTCT TCAAGCAAGG CTTTAGAGTT GCATTAAGCA 

3501 ATTTATCCCC CGTCCACCTC CCCTTCCAGC ATCCCAGGGA TGGCAGAGGC 

3551 ACCCATGAGC CCCAGAAGGG ACAGGGGGTA AGATATTGAT GATGATGCTT 

3601 TTTCTTGGAG TGTTAGTTGG AAGAGAAAAT CTGCCCAGAC TTTCCAAGGT 

3651 ACAAAGCATT GTCTTTGTTG GTTTCAGTCT TGGGTGACAT CCAGGGGACC 

3701 GAGTGTCAGG GAAACTATTG TTGAGCAAGA GCAAAGAGCA GGAATTGGTG 

3751 CTGGGCAGGA AAGGAAGCCT CAT C AG AG C A GGCCAGTGAG TCACCAAATG 

3801 GGCCCTAAGT ATTTGAGTTC CCTCAACTGG GAGAAGGAAA GCAAATGCCC 

3851 CTCACCCACT TCCAGTCATC AATCCACCGG CTGTCACCCT TGAGTTTGTA 

3901 AGCCCTTGTT CCTACCGCTC CTGAGTTTCT ATGAAAGGAC CTTGAGGTGT 

3951 TCAACAAACA GGGAAGGGAT CAACTCTCCC CACCCTGCGT TGACCAATGA 

4001 ATTCTTCCCT CCTCTGCTGC CCAGTGAATT AACAGGAGAA AGAACTCCGG 

4051 TATTGGAGTT ACCACACATA AAGGATAGTG AGTCAGCAGA GTGCACCCTG 

4101 CAGGAACAAT AGAGCCTTCC TTTTCAAGGA AGTTCTAAGA AAAATGGCAG 

4151 CAGGCAGGCC CCACTCGGGT GTATTCACTC ATTCATTTAT TCAACAAATA 

4201 TTTACTAAGT GCCCCTGTGC AAGGCTCGAG GTGTACAAAG ATGAACAGGA 

4251 GAGCTAGACT TCTTGCCATG CGTGGTGGGG TTTGCTGCCT AGTGGGAGAG 

4301 ACAGACAAAA AGCAAGGAAT GCACACACAG GATGCACACA CAGCGGCAGG 

4351 AACCAAGGTG CAGTTACCCA GGCCTGGGAT CAGACAGACA GGACTCAGAG 

4401 GAGACTTTCC CAGAGAAAAG CCATCTGAGC CAAGGGATGG ATCTGATACC 

4451 TCCGAAGGCT GAGCCACCAT AACACTCATA CCTTTAAGCC AAGTCTTATA 

4501 AACTCCCCAG GTAAGCAGCT GGCAGTCAGA AGACCTCCAG CTAATGCCCA 

4551 GGACAAGTTG ATGAGCTCTC AAGAAAAAGT TCCTGCCTTT TCTTCTCAAT 

4 601 ATCCCTGGCA CACAGTTCAG TGAATTTTGA ATGAACCAAT GAATGAAATG 

4 651 AGCAGGATAT GATAATCCCT CTCCAACACG GAATGTCCAA GCCATGCAGA 

4701 GCCGACTGGA AATTTTCCCC GTTCCCTTCC AGATGTTCCT CAGCCAAGAT 

4751 CAAAGCTGTG GTGTTTGGGC TGCTGCCTGT GCTCTCCTGG CTCCCCAAGT 

4801 ACAAGATTAA AGACTACATC ATTCCTGACC TGCTCGGTGG ACTCAGCGGG 

4851 GGATCCATCC AGGTCCCACA AGGTGAAGGG GCTCCTTCAG CCAGGCCTGG 

4901 ATTGCCACTC CCCTCACCAT TCCTCTCCTC ATCCCCACTC CATCCCTCTG 

4951 TGATCCCCAT AAGCTAGTCA TGCTGCTGAG CTTCAGTCTC GTTGTCCTCT 

5001 GCAGGCATGG CATTTGCTCT GCTGGCCAAC CTTCCTGCAG TCAATGGCCT 

5051 CTACTCCTCC TTCTTCCCCC TCCTGACCTA CTTCTTCCTG GGGGGTGTTC 

5101 ACCAGATGGT GCCAGGTAAG GCCTCTCCCC TCTGGGCAGG CAGGATGACC 

5151 CAGACCACAA GGATGGGAGG TGTGGCAAAG GGGCCTCGGG AGATTTTCCA 

5201 TCTGCATTCT CCTGGAGTTG TTCCTGGTCA GTCCTAGGGG AATGGTCACT 

5251 GTGAATGTCA TTTCCAGGTC CTCGGTGACC TTGGAGAAAC CACTGAGCCT 

5301 CTTTGAGTTC AGTTAGCATT ACCTGTTCCA TCTTCCTCCT AGGAATGAGA 

5351 GGAAGACTTA GCAGAACAAG ATATACCATA TGCTATAACA TGCTTAAACA 

5401 GATGTGAGAA ATCACCATCT AACTCCCTGG TTGGTCCCAG CCGGCCACTA 

5451 CAGGGACATT TGGACTTCTC TGGTGCTAAG TGAGATGGAG GAAAGCCTGG 

5501 TCACAAGGGC TGGTTTCTGG TTCAGGCTCT GCTTATATTT CTTATTTCTG 

5551 AGTTCATTTT CTCACGTGTC CTGTATGACA ATATTGACCA TTGGGGTAAA 

5601 AGCACCTTGA AAAGCATAGA TCATGGTTAG AGTGAGTGGT TGTTATTATT 

5651 GTGTTGGAGA AGAGCCTTGG AGGTGCAGGG ATCCATCCCC CTGGGGTCGG 

5701 GAAGCATTCC TGGGCCCCTT TCTGGTTTCC ATCGGTGTGG TTCAAACCTC 

5751 TGATTTTTGC TGGCTGGGTG GGGCACCACA GGTACCTTTG CCGTTATCAG 

5801 CATCCTGGTG GGTAACATCT GTCTGCAGCT GGCCCCAGAG TCGAAATTCC 

5851 AGGTCTTCAA CAATGCCACC AATGAGAGCT ATGTGGACAC AGCAGCCATG 

5901 GAGGCTGAGA GGCTGCACGT GTCAGCTACG CTAGCCTGCC TCACTGCCAT 

5951 CATCCAGGTG AGGGGGCAGC CCCCAACCCT GCTAGAAGGG CATCAGACCA 

6001 CCCTGCCCCT CCCTCAAAGC CTTAGCTTTG ATGCTAAATC TGATTTAGGG 

6051 GGCTGGGTGT GGAGGCTCAT GCCTGTAATC CCAGCACTTT GGGAGGCTGA 

6101 GGAGGGTGGA TCACTTGAGG TCAGGAGTTT GAGACCACCT TGACCAACGT 

6151 GATGAAACCC CATCTCTACC AAAAATACAA AAATAATCCA GGCTTGGTAG 

6201 TATGCGCCTG TAGTCCCACC TACTCAGGAG GCTGAGGCAG GAGAATCACT 

6251 TGAATCCGGG AGGCAGAGGT TGCAGTGAGC TGAGATCGCG CCACTGCACT 
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6301 CCAGCCTGGG TGACAGAGCG AGACTCCGTC TCAAAAAAAA AAAAAAAAAA 

6351 AAAAAAAAAA CCCAAGTTAG GGCTCACCTC CTCCCTCCTC CCCATCCCAG 

6401 GGCTAAAGTG AACCTTGAAA ATTAACAGTA TCTCCTCATC TGCATGTAGC 

6451 AGGACCATAC AAAAAAACAA CAGCTGTACC TGGTTAAACT GTCCTGAGCT 

6501 TTAAACCTGT AAAAGACTCA CAGCCTCTCT CCATTATCCC GTGGAGAAAC 

6551 CCAACTCTCT GCCAGCATAG TCTTGCAGAC TGCTAATTTT CTCTAACATC 

6601 CCTCACTCCG CTCCAGCCTC CTCTGCTCCA AGCCACAGCA GCAGTTGCAC 

6651 AACATAAATT GAGCTTCTGC AAATGGTTGC AAAGGATTCT GCTAGGTTTT 

6701 ATGAAGGGAA GCACAACATG ACAGAATGCA AGAGCAAAAC ACAGTCCCAG 

6751 AGAGCGCCTT TTCATTCACT CATTCATTCG GTTTTGTGCC AAGAACTAGG 

6801 CTAAACCCTG GGATACAAAG ATAAGTAAGA AAGAGGTCCA ATTCACAAGT 

6851 TGCTCACAGC CCAGCAGAGG AAGGAGCCAT GTCAACAGAT AAATTTGTAT 

6901 GCAGTGAGAT AAGCAGCAAA GTAGAGCCAT GTACAAAGAC TGTAGGGACA 

6951 CAGAGCAGAG TCACGGAGGA CCTCAAAGAG GAGGTGACAC TCCACCTCTC 

7001 TTAAAGGATG AGAACTTAAC CAGGAACAAG GTATACAGAG GATGGTCCAG 

7051 GCAGAAGGGA ACAGTGCCTA AAAACACTGA GGCCTGAGAG AGTGTGATCT 

7101 GCGCAGGCAA AGTAAGGGGC TTGGTGTGGC TGGAGGGTAG AGGGCCCAGA 

7151 AGAGGATGGA AAAGTAGGCA GGAGCCAGAC AATGAGATCT GGGGTCTGTT 

7201 CTCTGACAGC GACTTTGGGT CTGATTGGCA GTTTATAAGG ATCGTTTGGG 

7251 CTACACAATG ATGAGTGGGA GGTGGATTAG AATCAAGGCA GGGGACCTGT 

7301 TGGGAGACTC TGCAGAGGCC CAGGCAGGAA TAATGCAGGC GAAGACCAGG 

7351 TAGAGAAAGA GATGGGGCTG GACTTGAAAA GAATGTTTTA CCAGGAGCTT 

7401 GGTGATAGAC TGGATGTGGG AGGTAAGGGA GGATGACTCT CAAGTTTTTG 

7451 GTTGGGCAAC CAGGTTAATG ATGGTGTCAT TTACTGAGAG AGAAAACACT 

7 501 GGGGGAGGAC TAGACTTATT TTACAGATAA GCCAAAGCCA GAGAGGTGAT 

7551 GTGACAGAAA GGCCCATGCT CTAAAGGAGC TGAAGGTCTG ATGGCAGCCA 

7 601 TGTAGAGCAC AGTGAAGGGC AGGTGAAGGT CACAGATGGT CCAATTCCCT 

7 651 CAAGCTACTG CTACGCTAGG ACTGCACGGA GCTCCAGACC TGCGTGTGTG 

7701 TGGGGCGGGT CGTTGGAACT GCTGAACCAC ATTGGTCTTC CGCCACCAAC 

7751 CACCCTTTTC CTCCTCTCAG ATGGGTCTGG GCTTCATGCA GTTTGGCTTT 

7801 GTGGCCATCT ACCTCTCCGA GTCCTTCATC CGGGGCTTCA TGACGGCCGC 

7851 CGGCCTGCAG ATCCTGATTT CGGTGCTCAA GTACATCTTC GGACTGACCA 

7901 TCCCCTCCTA CACAGGCCCA GGGTCCATCG TCTTTGTGAG TCTGGGGATG 

7 951 CACCCCTGCC ATTGGAGCAA GGCTCCAGCA GACACATGAG GAGGATGTAC 

8001 TGTTTTAAGA TGTCGTGAGC TCCTCATTGC AAGGGCTGGC TTAGCTGTTG 

8051 TTCAGAGAGG ATTCTGAGGG GGTTTCTGTC TTGGGAGGGT CAAAGTCATG 

8101 ACTCACAGAG GTTCTTGGTA GTTAATACCT GCAGAAAAGA GCTGTACATT 

8151 CTCCGCCAGT TCCCCATTCT AGTGCCTCAA CCCCTCCCTG CCTGGAAAGT 

8201 CCTGCCTTAT GTCTAATCTC CATCCCTCCT CCTTCAGCCC AAACTCTTCT 

8251 AAAGAAAAAG AAAGCATTCC TTTTCTAGCA CAAGTTCCCC ATGTGCCTTT 

8301 TGGGAAAGGG CGGTGGGCGA CGGGACAGGG TTCCTGATCA GGGTTTTAAT 

8351 TCTGTCTTGG TGTGCCTCCA TTAGCTTTGA TGGCATCCCT TCCCTGGGTC 

8401 AGACACCCAA AGGTGGGGTA TTATGGGAAG AAGGGGTGGG AGCCTGTGAG 

8451 CATGATGCTC TTTCCCCCAG ACCTTCATTG ACATTTGCAA AAACCTCCCC 

8501 CACACCAACA TCGCCTCGCT CATCTTCGCT CTCATCAGCG GTGCCTTCCT 

8551 GGTGCTGGTG AAGGAGCTCA ATGCTCGCTA CATGCACAAG ATTCGCTTCC 

8601 CCATCCCTAC AGAGATGATT GTGGTAAGGA CCTTGTTCAG AGCTGGGATG 

8651 TTGGGGGGCC AGGCTGTGAG ACGAGGAAGC CCCTACCTTT CCTCACCCCA 

8701 TCCCCTCAAC TGGCAGCCAG TGGGACAGGA AGTCAGTTGT GAATCCATCC 

8751 CATCCCCCGT ATGTGGCGTT TCCTCTCTTT CTACTGCTCT AATAATTCCC 

8801 CCTAAGGAGG CAGGGGAGTG GGATTCAGGG TCCCCAGAGA AAAGGGAGAC 

8851 TTGAGAGAGA CGCCTGCCCT GGCCCCACCT TAGGGCCAAT CCCCATTCTC 

8901 CACTCTGGGG TTTGCAGGTG GTGGTGGCAA CAGCTATCTC CGGGGGCTGT 

8951 AAGATGCCCA AAAAGTATCA CATGCAGATC GTGGGAGAAA TCCAACGCGG 

9001 GTGAGTCCAG GTGGCCCAGA AGCCTGGCCC ACCCGCACCT CATGCCCCAC 

9051 TAAGGCCTGA GCTCGGAGAG GGAGACAAGA TGAACTCTAT GAAAGTGCAG 

9101 TCGAAACTGT ATGACACTGA CCATGTATGA ATTATTACTA TTACCGTTTC 

9151 CTGAGAAGGG CCGCACAACC AGCCAATGTA GGCTATTTTA TGAGAAATGA 

9201 GTCTTAACTG CCACACTCCC CTTATAAATC TCATTCAACT GATGCTGTTA 

9251 AACAAAGCCT CTCTGAACAG CCGCTTGCTG GCTCTTTGCC TTGCTCTAAT 

9301 GCATTGGTTC TTTGTCCATG TAGAAAGGGA ACTATTAGGT TCAACCAGAT 

9351 TCATGAAGCA TCCACTCTGT GCCAGGCACC ATGCTGGGCC CTGGGAGGAG 

9401 AGGGGTGACG CTTGTCCTGC AGGGTTGGAA CAGGCAAGGG AGGGAAGACC 
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9451 ACATAGCACC AAAGGTCTAG GGGTCTGTGG ACTCGTGAGC ATACAGGGTT 
9501 CAGAATCTGG GAGTTAACAA ACGAGGCCCT ACCACATACT GGCCCGGGGA 
9551 CCTTGGGCAA GTTAGGTTCT CTCAGCCTCA GTTTCCTCCT TTGTAAAACA 
9601 GGAGTGATGG TCCCTACCCT ATGGGGTGGT GCTGAGGATT CAGACTGGAT 
9651 GGGATAACTT AGGCAAAGAT CCCGGCACAC CATGGGGGCC TGGCTGGTCC 
9701 CTGTGGGCTG GTGAAGGACT TGGCTGCCCT CCCCACTCAC ACCCTTGGGT 
9751 TCTGCCTCCT TCCTGGCTCC TCGGCAGGTT CCCCACCCCG GTGTCGCCTG 
9801 TGGTCTCACA GTGGAAGGAC ATGATAGGCA CAGCCTTCTC CCTAGCCATC 
9851 GTGAGCTACG TCATCAACCT GGCTATGGGC CGGACCCTGG CCAACAAGCA 
9901 CGGCTACGAC GTGGATTCGA ACCAGGTAGC TCTGGCCACC CCCGGCAGGA 
9951 CTGGGCAGGA CAGGTCAACT CAGGCCTGGC ATGACATATC TTGGGTGGGG 
10001 AGATCATTGG GCTGAGGTGA GGCAGGCTGC CTCGAGTGTG GGGGATAGGG 
10051 GGTCCTCTGA CCCTAAGAGG CTGACCTCCT CTTGACTGGG AATGTGTGAC 
10101 TTTATAGCCA CTGGGTCACT CTCAGGTCTT AGGCCCACAG TCCAGCTTGC 
10151 ATGCCTGACT GCACTTGGTC CCCGTGCCCC CCAGCCCCAC ACTGGCTTCT 
10201 AATCCTGTCC CCTCCCTGCA GGAGATGATC GCTCTCGGCT GCAGCAACTT 
10251 CTTTGGCTCC TTCTTTAAAA TTCATGTCAT TTGCTGTGCG CTTTCTGTCA 
10301 CTCTGGCTGT GGATGGAGCT GGAGGAAAAT CCCAGGTGAG CCTTGTTCTA 
10351 GGGGAGTTGG GGGGAGGTGG TAAGAGAACA GTTGCCCCAA AAAAGCCTGG 
10401 GCACTGCAAG CCAGGCCAGC TCTTCTCCGA CCCCTTCTTC CCGTACTTAG 
104 51 TCTCCACTCC ACCAAAGCCA TGGATTGGAA ATAAATCAAG AGCAAAAATT 
10501 TCACACCTTC CCTCTATCCC CAACTCTTTC TCGGAATAGG TGGCCAGCCT 
10551 GTGTGTGTCT CTGGTGGTGA T GAT C AC CAT GCTGGTCCTG GGGATCTATC 
10601 TGTATCCTCT CCCTAAGGTA AGAGCCCAGC CATCGAGCAG AAGTCAACGA 
10651 AAGACTCCAA TAAGAACAAT CCCTGAGAGT TGTGTGGCAC TTTACGGACC 
10701 ACAAAGTGCC ACTGTTGTCA TACTTAGTCT CAACCACAAA CTGTGAGGTA 
10751 GACAATGCAG GTTTTATCCT CCCCATTTTA CAGGTGAAGG AAACTGAGTC 
10801 TGAGAGTCTA AGTAACCTTG TCCATAGTGA GGCAGCTTAC AGCGCAGGGC 
10851 TGGTCCCAAA CTCCAGCCTT CTGGCCTCAG AGTCTAATCC CTAGGCAACA 
10901 TTTGCACCTA CCCACGAGTA CCAGGCTCTT ATATAGCCCA GCTAGGAGGG 
10951 CTCTAGGCAT GCGTCATTTA GAGATGAGGG AAGAGAGATA GGGAAAGGAT 
11001 GGGGCCAGGA AGGACCCCAT GGCTCTAACG CCAGCACTTT CCAAACCTAA 
11051 GGTCGAATGC AGAGATTTGG GGGATCAGCC AGGGGAGGTG TTCCAGAACT 
11101 CCGTCTCTGT CCTGCCAGGC CTTGGGGTCG GGTATGCGCA GGAGGGCAAA 
11151 AAGAAGGGGA GACCCTGGGG TCCTGGAGCA ATGTTCTGCT TCTCTAGTCT 
11201 GTGCTAGGAG CCCTGATCGC TGTCAATCTC AAGAACTCCC TCAAGCAACT 
11251 CACCGACCCC TACTACCTGT GGAGGAAGAG CAAGCTGGAC TGTGTAAGTA 
11301 TCGGGCAGCC TCTGGGTACT GGCCATGCCC CTGCCCTCTC CTCCAACCCC 
11351 ACAGCCCTGT CAGCCCTGTC CTAACAATGA ACCCTCTAGT CTGCTGCTTC 
11401 CTAATTAGCA TGAGATGAGT GGTTAAAAGT CCGAGTTTCG AAGTGAAACA 
11451 TCCTATGTTC AAACCCTAAC TCAGCCATCT GCTGGCTCCA TGGCCAATAG 
11501 CAAGCCCCTT AACCTTTCCC AGTCTTGGTG TCTTAACTGG GCAAATGGTT 
11551 ATTTTATGCT CTCTGCCTCC CAGGGTTTTC TATGAAGAAG AAGCAAGGTA 
11601 ATACAAGTAA ACATGTTGTC TACATCGTAT TTTATACTCA ATAAAGCTTA 
11651 GCTATGACTA CTTTATGACA TACAGCTTTA AAAAACAAAA GGAAATAGTT 
11701 TGTATTTTAA AAAAAAACCT AGAACATAAA GCCAGAGGAC CAAAATCTTG 
11751 AGCAAGTTAC TAGACTTCCC TGGGGTTCTA TTTCCTCATC TGTAAATGGG 
11801 GGTGAGACTC ATGCAGTCAT GGTTGCGTCA AACGCTGGTT CCGAGGATTA 
11851 AATGAGATCC CAGTGGGAAA ACACCGCATG AGCGCAAACA TTCTGCAAAC 
11901 ATGACTTATT GTCCTGATTA GTCACACACT CCACCGCATC ATCCGCTGGG 
11951 CATAGTAATG AAGGCCAGTG TGTTTTGACG ACACTGCCTT CTCTCCATTT 
12001 AAGCCCCACC ATAACCTATG GGAGAGGATT TACTAAACTT TCTTAACGGT 
12051 GATGAAACCA AGGCTCAGAA TGGTTAAGTA AATTGTCAAA GGCCACAGAG 
12101 GTAGGGAGTG GTAGAGTCTG GATTAAAACT CCAAGTCCTG GACTCCAGAC 
12151 CTCTAGGCTG TACTGTCTCA TAGGGAAGGC AGTCTCACCC ACCTAGGGCA 
12201 GAGAAGAAAA TCCTTAAAGC CAGAGAAGTG AGTGGCTCAT CTGTGGTCAC 
12251 CCAGAGAGAC AGTGATGAGG ACAGGGAGAA AAATTATACC TCAGTTCCCA 
12301 GCCCAAGGAT CTGCTTTGAC CATAACCCAA CAAGCCCCCG CTATGGTGGT 
12351 ATTTCCTTAG GTTCATATGG CGGCTTTTGT TTCCATTTGA TCTTCACAGC 
12401 AATTCTCTAC AGGAATCTGG GCAGATTTAT TTCCTTTAGA GGAATTTCCA 
12451 GGTCTTAAAA TCTATAGGGG GCAACTATCA AAACTTCACC CAATGTTGCC 
12501 CCCTACCCAC ACACAAAACC AGGCCCCCAG CCGATCAGAA AGCACTGCTG 
12551 AGCTCCTGTC AGGGCCCACG CAGCTCGCTG TGAGACAGAG AGAGGGAACT 
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12 601 CACATTTATT GATCACCTAC TGAGCATCCA TCACTAGGCT AGGACCGTCA 

12651 CATTCCTTAA CTTTTGAATC CTTTCATGAG GTAGGCATTA TTATTCTCCT 

12701 TTTGTTTCAC ATAGCCATTA AAGAACAAAA TTTGGGGCTG GGTGTGCTGA 

12751 CTCACACCTG TGATCTAGCA CTTTAGGGGG CTGAGGCAGG AGGATCGCTT 

12801 GAAGTCAGGA TTTCAAGGTC AGCTTGGGCA GCTTAGCGAG AGCCGTCTCT 

12851 AGAAAAATAT AAAAGTTAGC TGGGTGTGGT GGCACGTGCC TATAGTCCTA 

12901 ACTATTCAGG AAGGTTAGGC GGGAGCACAA CTTGGGTTCC AGGGTTTGAG 

12951 GCTCCAGTGA GCTGATCTTG CCACTGCACT ACAGCCTGAG CAACAGAGCA 

13001 AGACCCTGTG ACTCCAAAAA CAAACAAACA AACACATTTT GAACCCAAAC 

13051 AGATCTGACC CAAGATGCAT GCTCTTATAG ATGCCACCTC GCTGTGTGCT 

13101 GGGGCTTCTA CTAAAAACAC AGACAAGATC AGGCAACCAC AGTCAATCTA 

13151 AGGGAAAGAG GAAAGTGTAA CCAAAGCACA AATACATAAA ATATTGCAAA 

13201 AATGCTATTT AAAGAAAAAA AAGAGAAGAG AGGCTCTGAG GTTGTACTAA 

13251 CAGAGAATGG CCTTGGCTAA TCCAGGAAGA CTTCCTGAAA GAGGTTGTTT 

13301 TTTCCCCAGG TCTGCTTTTG ACATCTCTCT TTTCACAGTG CATCTGGGTA 

13351 GTGAGCTTCC TCTCCTCCTT CTTCCTCAGC CTGCCCTATG GTGTGGCAGT 

13401 GGGTGTCGCC TTCTCCGTCC TGGTCGTGGT CTTCCAGACT CAGTTGTAAG 

13451 TGATAGCTTC CGCCCTCCTA GGCCCACAGT CGGTTCCCTG GGCCAGCCCG 

13501 CAAAGGGCTT CCATGCCACG GCCTGGCTTA GTCCACTGTA CCTTCCACCT 

13551 CTGGGCCTGG CACTGGAGGT GCTGCCAGGC CCAAAGAGAG CCCAACCCAG 

13601 CCAGGACTGT GGGCACAGTC TGGGCTGTTT GACTTCCCAT ATCTTGAAAA 

13651 CCCCAGAGAA AGCCAGCATA CTCTTGCTGG GGATGGCTGG GGAGAGGGCA 

13701 GTGGCAGAGA AAGGAGGGCA AGGGCAGGTG GTGAGATTCA ACATCCTTCC 

13751 AAAGACATTG CCAGAACCCC AAACCAAATG GGACCCCACC CCAGGAGAGC 

13801 GCCAGGGTGG AAGACAGAAG CTGTGTTCTA CACACTGGGA GTATTACAGA 

13851 GAAGGGGTCT TGGCCAAGGC AGGGAGTACG CTGAATGTTG GGGGAATCCT 

13901 ATCTTCTCTT CTTGAGAACT CAGAACAAGG AAATGATGAC TTCAGGGCGA 

13951 CTCCCACCAC TTCTCCCACC ACTTCTCTCC CCTGCCCTGT GGTCTGGGAG 

14001 CTATGTCAAG GACCTGCCTG TCATCCTCAT AGTTATAGGA GGCCACAGGC 

14051 CACCAGACAT GTGTCTCCAG TGCAAAAAGA CAGACACAGC AAGTCTGGGG 

14101 GTGAGGACAG GACCCCATCC TACCTTGGCT CTGCCCCCGC CCCAGCAGGG 

14151 GCACCCTTCC AGGCCCATGT GCCATTAGCA TTCTCTTATG TTTTTCTCTT 

14201 CCTGCTTCAT CCAGTCGAAA TGGCTATGCA CTGGCCCAGG TCATGGACAC 

14251 TGACATTTAT GTGAATCCCA AGACCTATAA TAGGGTAGGT AATTCAAGCT 

14 301 TATGACCTCC TTCTTTTGCT CTGCACCACC CCAAGAAGAG GTTGCTTTTT 

14351 AAAGCCAATA AAGACATTTC TGCAACTTGA GCTCAGTCTC CCTGTCACAG 

14 401 GCCCAGGATA TCCAGGGGAT TAAAATCATC ACGTACTGCT CCCCTCTCTA 

14451 CTTTGCCAAC TCAGAGATCT TCAGGCAAAA GGTCATCGCC AAGGTAAGGC 

14501 TCAGTCCCTG GCGACCAGAG GCTCTGGACA GAGAGTGGCC GGAAAATGGA 

14551 AGCAGAAGGG CGGTGGGAGC TGAGAATAGG CCACTCCCAT AGAGGGTGGA 

14 601 GGTCAAGATT GCTGTTGGCT CTCTCCCTGC AGACAGGCAT GGACCCCCAG 

14 651 AAAGTATTAC TAGCCAAGCA AAAATACCTC AAGAAGCAGG AGAAGCGGAG 

147 01 AATGAGGCCC ACACAACAGA GGAGGTCTCT ATTCATGAAA ACCAAGGTGA 

14751 ATGAAGGCCA GAAGCAGCCC CGTGCCCTGC TCTCCTGCCC ATTCTGATAC 

14 801 TGCCCCCTGT TACTCATGGT ACCCTGGGGG CCCCGCTTCC CACCCTGACA 

14 851 GGCAAAGACA GAAAGTCTCT GGGAACACTG CCTGGTGGCC GCTGGGCATT 

14 901 TTTCTTCTTT TTTTTCTTTT TCTTTTTAGA GATGGAATTT TGCTCTTGTC 

14 951 ACCCAGGCTT GAGTGCAATG GCGTTATCTT GGCTCACTGC AACCTCCACC 

15001 TCTGGGGTTC AAGCGATTCT CCTGCCTTAG CCTCCCAAGT CGCTGAGATT 

15051 ACAGGTGCCA CCACACCCAG CTAATTTTTG TATTTTTAGT AGATATTGGG 

15101 TTTCACCATG TTGGCCAGGC TGGTGTCAAA CTCCTGACCT CAGGTGATCC 

15151 ACCTACCTTA GCCTTCCAAA GTGCTGGGAT TACAAGCCTG AGCCACTGCG 

15201 CCCAGCCTGG GCATTTTTCT TCTTGGATGA GGTGCTACCA TCTCCCAGGG 

15251 AAGCCACTGA ACCCCCAAGG CCCTTCTCCA TTTTCTGGCT AAGATAGGAC 

15301 ATGGCCCATG GACTTTTGAA CAACCCAGAG GGGGAACAGC AGTGAATTTC 

15351 CTGGGGAACC CAGGCAGCCC AGGGCTAGCA AGGCTGGGGT GGCCATGGCA 

15401 GTAATCCTTG TAATCCCAGC ACTTTAGGAG GCCGAGATGG GAGAATCACT 

154 51 CTCATGAGTT CAGGAGTTCG AGACCAGCCT GCCCAACGTG GCGAAACGCT 

15501 GTCTCTACTA AAAATACACA AAAATTAGCC AGGCGTGGTG GTGGGCACCT 

15551 GTAATCCCAG CTACTCAGGA GGCTGAGGCA CGAGAATCAC TTGAACCCGG 

15601 GAGGCAGAGG TTGCAGTGAG CCGAGATAGT GCCACTGCAC TCCAGCCTAG 

15651 GCAACAGAGG GAGACTCTGT CTCAAGAAAT AAAGGAGCTC AGTGTCCCCG 

15701 GAGGGGCTTT CTCCCAGAGA GAGTGGGCTT GAGGCTTCAG TGCCTCTCTT 
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15751 GGCTGGGTCC TCTGACTTTG TCTGGGTTGT AGGAGACCAA GTTTGCAGGC 

15801 CCTGCCTAAG AAAGGGCTTT GGGAGAGGCC TCTCTGGTGG AGCTTTCAGG 

15851 GTCTGTGTTC ACCATCACCG AGGCGAGTTA TTCCCCTACA CCTACACCCT 

15901 CCATGCCCCT GCTTCAGTCA CAGCAAGGTC TGGCTCAGTC TGGTGGTCCC 

15951 TGACTCTGCC CACTGTCCCC ACCCTTCCAG ACTGTCTCCC TGCAGGAGCT 

16001 GCAGCAGGAC TTTGAGAATG CGGCCCCCAC CGACCCCAAC AACAACCAGA 

16051 CCCCGGCTAA CGGCACCAGC GTGTCCTATA TCACCTTCAG CCCTGACAGC 

16101 TCCTCACCTG CCCAGAGTGA GCCACCAGCC TCCGCTGAGG CCCCCGGCGA 

16151 GCCCAGTGAC ATGCTGGCCA GCGTCCCACC CTTCGTCACC TTCCACACCC 

16201 TCATCCTGGA CATGAGTGGA GTCAGCTTCG TGGACTTGAT GGGCATCAAG 

16251 GCCCTGGCCA AGGTGAGGCC CTCGGGGACA GCAAGCACCA CCCACTCCAC 

16301 CCCCTCCGCT CTGCTCTCCA CATTCCCTTT CCTGGGAGCC CTCATTTCAG 

16351 GAAGCTGAGG GAGGAAGCTC ACTGGGGAGA CTAACAGCTC CTAGGAATCC 

16401 CTCCTTTCCC CAGACGCCAC CAGGTTGAGA CATTCTCCAC AGAGCAGGCC 

16451 CAGACGGCCC ATGACAATGA GTGGCGGGAC AAGTCTACCA GAGTTTCAGG 

16501 CCCCTGTGCT CCCAACACCC CCAGCAGTGG CCATCCCAAG TCCCTCTCAG 

16551 CCATCAGGAA CCCACCCAGG TTCTCTGAGG AGGGTCCAGT TTGGCTCCTG 

16601 GTTCATGATC TGCTGCCCTT GTCCCTCATT CACCAGCCAC CCTAGGACAG 

16651 GAGAAGAAAT AATACCAGTG CCCCACACCA TCAGGCCAAA CAGAGAGCCC 

16701 ACGGGACACC TTGAATGAAT GTATCCATCT GATAACTTTC CAGCAGCCAC 

16751 CGCCAATGGC GGGAGTCAGC AAACCTCAGA GCTGGCTCAG ATAGAGGCAA 

16801 GCCAGGGGAA CAATGGGCAC AGAGAGTGTT CGGACTGCCT TCACCATCAA 

16851 CCAGGCGCAG GGCAGGCCCC ATACCCAGCC TTGGGCCTCA GCCGGCTTCC 

16901 TTAGCCAGGA TCTGGAGTCC AGGCCAGCCT TGGCTGAAGC TCTAGACTCC 

16951 CTGAGCCTCC ATCCTCCCCT GCAGCTTCTG TCTGAAGCCA CAAAGAAGTC 

17001 TGAGAATCTA AGCTACTGAA AGAAAAGATC AGCCGGGCGT GGTGGCTCAC 

17051 TCCTGTAATC CCAGCACTTT GGGAGGCCAA GGCAGGTGGA TCACAAGGTC 

17101 AGGAGTTCAA GACCAGCCTG GCCAACATGG TGAAACCCCG CCTCTACTAA 

17151 AAATACAAAA ATTAGCCAGG TGTGGTGACG GGCCCCTGTA GTCCCAGCTA 

17201 CTCGGTAGGC TGAGGCAGAG AATTGCTTGA ACCCAGGAGG CGGAGGTTGC 

17251 AGTGAGCCAA GATCGCGCCA CTGCACTCCA GCCTGGGCAA CAGAGTGAAA 

17301 CTCCATCTCA AAAGAAAAAA AAAGAAAATA TCTAGCCCCA CAAGAAGGGG 

17351 CCATGGTGAC TTTAAGTGCC CGCCACGTTG GCAAAAGTCC ATTTCCGCTC 

17401 CACTTCCCAG AGAAACCGTC AGCCAACACT CCAGGGAGAA GTGGTGTGCT 

17451 TTGCTGCTAT TTTTGTCTTT GGCTGCTGGG CTCTCAGGGT TGCTTATTTG 

17501 TTTGGCTTCC CCTCTGAAGT ACGTTTTGTG AATCACTTTT GAGACCCACT 

17551 CAGAACATTC CTTTCCTTTT GCCTCCCTAC CCCAACAACA CTTCTAGCTG 

17 601 AGCTCCACCT ATGGGAAGAT CGGCGTGAAG GTCTTCTTGG TGAACATCCA 

17651 TGGTAAGAGA AAGAGGACAT TTAGGGACTG AAAGACTGGC AAGGAGTGTG 

177 01 GGGTAGGAAC AGGTTGGTGG GGTCTGAATA GTGAGGAGGT TGGAAACGAG 

17751 AGCACCCAGC TATCCCCCAC AAGCTGCTGC CTGCTCATAA AAGCTTCAGG 

17801 TACAAGTCCA AAGAGACTGG TCAGATTGCA TAAACATCCT AGGGGCCTTA 

17851 GTGACAGAGT GGGGGTGAGG AGGTCATGGA GTTACAGAAG GACAGCTAGG 

17901 ATTCTAATCT ACCCCATAAC TAATTTGCCA CGTATCCTTG GCCGAGTCAC 

17951 TTTATCTCTC AAGGGATCTA TTTCTACCTA TGTAAAACGA GAGGGTTGAC 

18001 TAGATGGATT TGGGGATCCT CTCCCAATCA GAAACTCTGT GAATCGATAT 

18051 AGGCATAGAG CACACGGTAC CCTAATTCCC CAGGGAACAT AT AAAT AT GC 

18101 AGTTTTGTAG GCATACAGCC TCCAAAGGGT GCATATACAC AGCCTCAAGG 

18151 ACGTGGCCAC AGGGCAGCAG ACATTTACAT GACTAGCATG TACGCAAAGT 

18201 GCAGAGATGT GGGAGCAAGT GCACACAGAC ACACAGGAGA ATGTGAAGGG 

18251 GCACATACAC ACACACCCAG CTCCCTGCAC TGGGTCAGAC CCCCTCCAGC 

18301 AGGGCTGCAG TTCCCAAGCT CCGCATGGCC ACGTTCGGGG AGAGAATCTG 

18351 CAGTGGCAAT GACCTGCTAT GATATGTTCT GGAGTTAGAA GCAGTGGATT 

184 01 CTCCCCAACC TCACTGGACA CCCCCTTAGG AAACCATCTC TAGGATTAAG 

184 51 AGTAATCCAC ACAAACTTCC AATGCCACAC ATTGGAAGTT GCTGGAAAGG 

18501 TCTGGGAAAA CAAGAGGAAG GATGGGTCCT TGGGGGATAG AACTGGCAGC 

18551 GGCCTCTTCA AGGATGGCTT AGGCTTTTCC ACTCGAATCA CCACAAAGTA 

18601 CTGACTCCCT AAATCAAACT GCTTCCTTCT GCTCTGGGTT GAAACTTCAG 

18651 CATCCTCAAG TTCATGTTGC CCTCTGCCGT CCAGAACTGA TATTGCACTG 

18701 CCAATGCCAT GGCCCTCAGA TACAGCAAGA GCTGGGACCT CAGGCCTCTC 

18751 CCATCCCTGC TCTGGTCTCA CTATCTTCCC CACCCCCAGC TCCAATCCAC 

18801 AATGGCTGTT ATCTTTCTGA AGGTGATCTT TTCTCCTTCT AGCCCAGGTG 

18851 TACAATGACA TTAGCCATGG AGGCGTCTTT GAGGATGGGA GTCTAGAATG 
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CAAGCACGTC 
ATGCTAGAGA 
CACCTGGGGA 
CATTATTAGA 
CCATTTCAGT 
AACTAACATT 
TAAATGATAA 
AAAAATTCTT 
TAGTAGAGCA 
TCTTTCTAAA 
CATTTTAACT 
GATTGGGTCT 
AGTAAGTCAT 
ACTGGGGAGT 
CAGTCATCTC 
GCCTCCCAAG 
CCCTCTGTGG 
CTCTTTGCCT 
GAAAGACTGC 
AGAGGGAAAC 
CCGCAGAGCA 
CCCTAATCAT 
AGCACTTATA 
GACCATTCAC 
CATTGTGAGG 
AGATTAGAAA 
GCTAGCCAAT 
GCCCACAAAG 
AAGACCACAG 
GGATAAGTCA 
CCCTTTTCCT 
ATTCTGCACT 
CTTGTACGAC 
TGAGCTGAGG 
GATGGGGTCG 
GCCCAGAGCA 
TTTTTTCTTT 
TCACTCTATC 
AACCTCTGCC 
AGCTGGGACT 
ACAGCTTTTC 
CAATCTGCCT 
CGATGCCTGG 
GGGAGAGGGT 
TATTCTCCAA 
GGGCCCTAGT 
TGTCCCATCC 
AATTCAGCAG 
GATGGGCACA 
TACGTTATGG 
NNNNNNNNNN 
TGTGGACAAG 
ACACCTGTAA 
GGTCAGGAGT 
CTAAAAGCAC 
GCTACTCGGA 
GTTGCAGTGA 
CTAGACTGTC 
GATATGAGGG 
CCAATGCCCT 
GCTTTGGGTC 
TTTTGACATT 
TGGCAGGTGG 



TTTCCCAGCA 
CGTGACCCCA 
ATCCTAGGCT 
AAGAACACAG 
TTTCTGAATC 
ACCTACCTCT 
TTACATCTTT 
TATCAATATA 
GGCCCACTCT 
GGGGGCTTTC 
TACCGGAGCT 
GAGTTTTGTC 
ACAAATGGGA 
CAGAAAAATC 
TAGCCCTCAG 
ACTCTTGTTG 
GTCATAAGAA 
TAAGAGACTG 
CCAGGAACAC 
CCTGGGGAAA 
AAGAGGGAGA 
GTTCAGCTAC 
AATAATTTAT 
AGATACAAAT 
GACAAAGCAG 
ATATATTCCC 
AGTTTTTCTA 
GAGGGAAGAC 
CCGTGCTGTT 
TGTGTCAGGA 
GCCACCATGC 
GGCTTCCTCC 
TCAGAGGAGG 
GAAGGGGCTG 
CTGTCAAATA 
GTCACATTCA 
TCCTTTTCTT 
ACCCAGACTG 
TCCCGGGTTC 
TCAGGCCTGC 
ACCATGTTGG 
GCCTCGGTCT 
CCTTTGTTTT 
GAGTCACTGG 
CACATCACCC 
TTTTATCATC 
ACCACACAAG 
GTATTTATGG 
TAGCAGTGAG 
TAGAATAGAT 
NNNNNNNNNN 
CCTTAAGAAC 
TCCCAGCACT 
TTGAGACCAG 
AAAAATTAGC 
AGTCTGAGGC 
GCCGAGATCG 
TCAAAAACAA 
ATCACCATCC 
GAATTAAAAG 
TATGAGAAGT 
GAGCTCCATG 
GACAGATTAG 



TACATGACGC 
GGACACAACT 
CCAAGGCACT 
GAGAAGGTTT 
TGTTTCCTTA 
CTGCATTTTT 
TGTGTCACTT 
AGTTTTCTGG 
GCTGGCATCG 
GGGAAGGGAG 
TAGAGATTTC 
TCAGCTTTTT 
TACAAATAAC 
CTACCTCCTT 
GGTCCCCTCC 
TGCCCCAGCC 
AGGCTTTTCT 
ATGAAGGTGA 
ACAGGGCGCT 
CAACGGGCTT 
TTCTGGAGGA 
TCTAGTTGGT 
ATTTTATATA 
CACACACATA 
GCAAAGTGAG 
AGGAGGACAG 
AGCTGAGTAA 
CGCGATACAC 
TTTGTGAGGC 
AGCAGCGTCC 
ACCAACTTTG 
CTTCCAGGCT 
ACATTCGCAG 
TGAGGGTGGG 
CAAGGCGTTC 
AGGCCACAAA 

GAATGCAGTG 
CAGAGGTTCT 
GCCCAGCTAA 
CTGGGCTGGT 
CCTAAGTGCC 
CATTCTTCTC 
ACCAAGTCCT 
ACGTGACCCT 
TGTGAAATTA 
ATTGGCAGGG 
CGTACCTACT 
CAAAACAAAG 
GGATTTNNNN 
NNNNNNGTCT 
TAAAAAATAT 
TTGGGAGGCT 
CCTGGCCAAC 
CAGGCGTAGT 
ATGAGAATCA 
TGCCACTGCA 
ACAAACAAAA 
CCATAGGGCC 
AAACCAGATG 
AGTGTCTCTC 
GTGCTCTGAA 
AAAATAGAGC 



AGTCCTCTTT 
TCCAAGGGGT 
GAAATAGCAG 
AAGTTCCAAT 
TCTATAGAAT 
CTTTTATTTT 
GAAAGCACTT 
TTGCACAAAC 
TTCCCTGCCT 
GGGAGGGGAG 
AGGCTGGTGA 
GACATTTAAT 
AC CAT CT AAA 
GGGGTCCCTG 
CAGCTCAGCT 
CTGGGTAAAA 
GGCCCTAGAG 
AACCATCTGT 
GGCTCCTGCC 
TCCTGCTTCG 
AGCTGCATTA 
ATGTATACTT 
ATATATACTT 
AACACACACC 
GCTGGTTATC 
GAATTCCCCA 
AACCTTCCCT 
ACCTGTCTGG 
AGGTAAGGGA 
AACCAGAGCC 
CTGTTCAGTC 
CCAGGGGATG 
CTACTGGGAC 
AGCAGGGCGA 
ACTCAGCTGT 
GATTTGTGGT 
TTTTAATTTG 
GCATGATCTC 
CCTGCCTCAG 
TTTTTGTATT 
CTCGAACTTC 
TGGATTACAG 
ACTCCCTGAA 
AGAGAACCAG 
GAGCAAGCCA 
GGGGAAACAT 
CAGTCACTTG 
GTTTGCCTGA 
GCCTCTGCCT 
NNNNNNNNNN 
ACAAATGAAT 
GTGGCTGGGT 
GAGGTGGGCG 
ATGGCGAAAC 
GGTGCATGCC 
CTTGAACCTG 
CTCCAGCTTG 
CAAAACCTAA 
CCTGGATTAA 
ACTAGGTTTG 
TTTGTGCCTC 
TCCGTCTCTC 
TGGAGCCACA 



GCCCAGGCAA 

AAGGTTCTTG 

GACCAAGAGG 

ATCAAGTCTG 

GAGCACCATC 

GTTTTAGGGT 

TGTGTATTGT 

ACCCAAAGCA 

CCTCCTCATC 

TAAGCCTACC 

GGGATAAAGA 

TTACTAGCTC 

ACTCCAGAAG 

CCCAGATCCC 

CCTGCCCTTG 

ACCTCCCCTG 

CAATGATTTG 

TCTAAGTGCT 

CTCCATGCCT 

TGAAATTTGT 

GTTGTTAGTG 

GATTAGTCAT , 

ACATATTATA 

TTTTCAACAG 

AGACTTTAAC 

AGGTCAGGCA 

GCCTCTAACG 

TATAAGGGGG 

AGGGGCAAGA 

GGCCACCTGT 

ACTGAAGCTC 

CTGAGCTCTC 

TTAGAGCAGG 

AGAGGGGAAG 

CTCACCTCCA 

CATCTTTGTT 

AGACAAAGTC 

AGCTCACTGC 

CCTCCCGAGT 

TTTAGTAGAG 

CGATCTCAAG 

GCATAAGCCA 

AGGCATCGTG 

TATCTATTCT 

CATACACCCT 

AGGTAATACC 

TTCTTTCATT 

CACAGTTCAG 

TTTAGAAACT 

NNNNNNNNNN 

TATTATTGCA 

GCAATGGTTC 

GACCACCTGA 

CCCGTCTCTA 

TGTAGTCCCA 

GGAGGCAGAT 

GGTGACAGAG 

AAGATATGTG 

CACCACCCCA 

GAGAAATCTG 

TTCCCATTCT 

ACAGTGCTGA 

GAGATTTGGC 
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22051 AGACTGATTT CGGTGCCCTC 
22101 GGATAGGACC AAAATAGCTT 
22151 CTACTGCCAA CCCTCATTTT 
22201 GCGAGGGACA CGTGGAAGGT 
22251 AGATTTCAGC ACCTGCTCCT 
22301 GACCCTGCCT CTCCCCTGTT 
22351 CACACACACA CACAGGTGCT 
22401 TCTTGGTTGC AGCCACTGTG 
22451 TCCCACTTCC TTCCTGTGGT 
22501 ACGCAGAGAC CCTGACCGCC 
22551 CTACAGAGTG CCTGGCACTT 
22601 ACAGGGGGTG TCGGGCGGAG 
22651 CTCTCCTCTC CCCCTCTCTC 
22701 GAGAGCCTCT CAGCAGCAGG 
22751 GGTGAGCCCA CTCTTCACCC 
22801 CTGCTCACTC CACCCCACCC 
22851 CACCTTGACT TCCAGCTTTT 
22901 CAACTGTGCT GGCCTGCTGT 
22951 ACGGCTCGGT GTCAGATGGT 
23001 TGGTTAGGAC TCTGGCTTGC 
23051 CAGCTCTGCA CCCAGGGACC 
23101 TTGGTAACTA TGCGCCCCCC 
23151 CACTGCTGTC AGCAAGCACA 
23201 CGTCGAGACC ACCAGAGCTA 
23251 CCTCAGCCTC CTGAAACATC 
23301 GAGGCTGGCT AGAAACCCCA 
23351 CTGGCATCCT AGTAATAGAT 
234 01 TTCCTCTTTG CACCCAGAGA 
23451 ACAGGGAAAA CGGATGGGAC 
23501 GGCCCATCAG GGAAATGCCT 
23551 ACCTGGCAAA GAGCCAATCA 
23601 AGAGACACAA TGGGGTCCTT 
23651 TCCGCTACAT CCCCTAACTG 
23701 TTGGGTCTTC CTGCACAGGA 
23751 CATCTTCTCC TATGAAGATG 
23801 AGGAGGTATT GAGAGGAACC 
23851 ATGTGTAAGT GTTTTGTAAA 
23901 GGCCTCTCAT CACTGCTTTT 
23951 CCTAACAGGT TGGGCTGGCC 
24 001 TCAGCCAGAC CTGCGTTGTG 
24051 ACAAGTCTTC TTAGAGTTAA 
24101 TTCTTGGCCC AGGAGCATTG 
24151 TGCCAATGGA ACTCAAAACT 
24201 TCACCGTGCA CAGCACCCGA 
24251 AGCCAATGAC AACTCACTCT 
24301 CAGGCCCAGG AGGTGGGAGA 
24351 AGAGTCCCCT TTGAACCAAG 
24401 GATCCTTAAC CCCCAAGTAA 
24451 AAGGAGATTG TTGACCCAGT 
24501 GCCACATGTC TGAGGGTTGC 



TTGGAATCTC CAGCACATTC CAAAAAGCCT 
ATCAACGTGA GAAAGGACTT CAGAGCTTGT 
ACCCAATGAG GAAAGTGAAG CTATTAGGGG 
CACACAGCAC ACAGGAGGTG ATTCACATGT 
GCCACGCTGG ACTGGTTCAC CTCCTAGGCT 
CACACACACT CTCGCACACA CACACACACA 
TTGTTCTGGC CAGGGGTTCC TAGGGTCACC 
ACCCCAACTG GTCTAACCTC TCTCTTCCCC 
TCCTGCAGGA GATGTTCGGG AGCATGTTTC 
CTGTGAGGGC TCAGCCAGTC CTCATGCTGC 
GGGACTTCCA TAAAGGATGA GCCTGGGGTC 
GAAAGTGCAT CCCCCAGAGC TTGGGTTCCT 
CTCCCTTCCT TCCCTCCCCG CATCTCCAGA 
GGGGTGCTAC CCTTACAGGA GTGAGAGTCT 
GTCAGGCCCT GGCCGCAATG GACAAGCCTC 
ACCTCTGCCC TGTCCTTGGC AGCTGAAGGA 
ACGAGTGAGC CAAAAACAGA AGGACAAGTA 
ACAAGCTTCA AAAAGTGTCC CAGAGCCCAC 
GTCAGGCTGT CACGGACATA GGGATAAACT 
CTTCCCCAGC TGCCTCAACT CTGTCTCTGG 
ATGTGCTCTC CACACCCAGG AGTCTAGGCC 
GTCCATCATC CCCAAGGCTG CCCAAACCAC 
TCAGACTCTA GCCTGGACAG TGGCCAGGAC 
CCTCCCCGGG GACAGCCCAC TAAGGTTCTG 
ACTGCCCTCA GAGGCTGCTC CCTTCCCCTG 
AAGAGGGGGA TGGGTAGCTG GCAGAATCAT 
ACCAGTTATT CTGCACAAAA CTTTTGGGAA 
CTCAGAGGGG AAGAGGGTGC TAGTACCAAC 
CTGGGCCCAG ACAGTCCCCC TTGACCCCAG 
CCCTTTGGTA AATCTGCCTT ATCCTTCTTT 
TGTTAACTCT TCCTTATCAG CCTGTGGCCC 
CTGTAGGCAA AGGTGGAAGT CCTCCAGGGA 
CATGCAGATG TGGAAAGGGG CTGATCCAGA 
AGACTCTTTA ACACCCTTAG GACCTCAGGC 
AAAATAGGGG TTAAGTTTTC CATATGTACA 
CTACTGTTGA CTTGAAAATA AATAGGTTCC 
ATTTCAGTGG AAATGCACAG AAAATCTTCT 
CTCAAGCTTC TTCAGCTTAA CAACCCCTTC 
CAGCCTAGGA AAACATCCCC ATTTCTAACT 
TGTCTGTGTG TTGAGTGAGC TGGTCAGCTA 
AGGAGGGGGT GCTGGCCAAG AGCCAACACA 
CTTTTCTGTG AATTCATTAT GCCATCTGGC 
TGGAAGGCGA AGGACAATGT TATCTGGGAT 
AGTGCCAAAT TCCAGGAGGA CAAGAGCCTT 
CCCCTACTCC ACCTCCTTCC AAGTCCAGCT 
AGGTCACAGA GCCTCAGGAA TTTCCAAGTC 
TATCTAGATC CCCTGAGGAC TTGATGAAGT 
TCATTAACCC CCAGACCAGC CTCAGAACTG 
GACCTGGAGT TGAGGCTCAG GGAGAGATCT 
AGAGCC 



FEATURES: 






Start: 


1997 




Exon: 


1997- 


2121 


Intron: 


2122- 


4732 


Exon: 


4733- 


4872 


Intron: 


4873- 


5004 


Exon: 


5005- 


5115 


Intron: 


5116- 


5781 


Exon: 


5782- 


5957 


Intron: 


5958- 


7770 


Exon : 


7771- 


7935 


Intron: 


7936- 


8470 



FIGURE 3, page 8 of 20 



Exon: 


8471- 


8623 


Intron : 


8624- 


8917 


Exon: 


8918- 


9000 


Intron: 


9001- 


9777 


Exon: 


9778- 


9925 


Intron: 


9926- 


10221 


Exon: 


10222 


-10335 


Intron: 


10336 


-10539 


Exon: 


10540 


-10617 


Intron: 


10618 


-11197 


Exon: 


11198 


-11293 


Intron: 


11294 


-13338 


Exon: 


13339 


-13445 


Intron: 


13446 


-14214 


Exon: 


14215 


-14284 


Intron: 


14285 


-14400 


Exon: 


14401 


-14493 


Intron: 


14494 


-15980 


Exon: 


15981 


-16262 


Intron: 


16263 


-17597 


Exon: 


17598 


-17652 


Intron: 


17653 


-18842 


Exon: 


18843 


-18988 


Intron: 


18989 


-20477 


Exon: 


20478 


-20549 


Intron: 


20550 


-22478 


Exon: 


22479 


-22523 


Stop: 


22524 





CHROMOSOME MAP POSITION: 

Chromosome 1 



ALLELIC VARIANTS (SNPs) 

DNA 



Position 


Major 


Minor 


Domain 




48 


C 


G 


Beyond 


0RF<5') 


132 


G 


A 


Beyond 


0RF<5') 


724 


A 


C 


Beyond 


ORF<5') 


1558 


C 


G 


Beyond 


0RF(5') 


1577 


A 


G 


Beyond 


0RF(5 ? ) 


2487 


C 


A 


Intron 




2634 


T 


C 


Intron 




4352 


A 


G 


Intron 




5157 


A 


C 


Intron 




5658 


A 


T 


Intron 




5945 


T 


C 


Exon 




6281 


C 


T 


Intron 




6452 


G 


C 


Intron 




6610 


T 


G 


Intron 




7247 


T 


C 


Intron 




7360 


A 


G 


Intron 




7644 


A 


T 


Intron 




8127 


A 


C 


Intron 




8317 


G 


A 


Intron 




9079 


G 


A 


Intron 




9537 


G 


T 


Intron 




12302 


C 


G 


Intron 




12354 


C 


T 


Intron 




12487 


c 


T 


Intron 




13198 




A 


Intron 




13257 


A 


G 


Intron 




14541 


G 


A 


Intron 





Protein 

Position Major 



Minor 



180 



FIGURE 3, page 9 of 20 



14545 


A 


G 


Intron 


15041 


C 


A 


Intron 


15053 


A 


C 


Intron 


15065 


A 


G 


Intron 


15108 


A 


C 


Intron 


16274 


- 


G 


Intron 


17424 


C 


T 


Intron 


17627 


G 


A 


Exon 


18427 


T 


C 


Intron 


18813 


C 


G 


Intron 


19035 


T 


C 


Intron 


19182 


T 


C 


Intron 


19508 


- 


G C 


Intron 


19571 


T 


G C 


Intron 


20147 


T 


G 


Intron 


20180 


G 


A 


Intron 


20584 


A 


T 


Intron 


20717 


T 


C 


Intron 


20894 


A 


G 


Intron 


21787 




A C 


Intron 


22264 


T 


C 


Intron 


22338 




C A 


Intron 


23363 


T 


C 


Beyond ORF(3') 


23688 


G 


A 


Beyond 0RF(3 ? ) 


24210 


A 


C 


Beyond 0RF(3') 



657 



Context : 



DNA 

Position 
48 



CTGGGTTCCTATGTGGGGAGGTCATGCTCCCCACTCATTGAGCCCCC 
[C,G] 

CAGGCAAACCACCTGGACAGCCAGACCCATGCAGACTCTGGAGCAGGTGGAGAGGAAGAG 
TGAGACCACCCCGCCTCACGGGCGGTGAAGGGCCGGCAGCCTCTGAATAGTCTCTGCTAG 
GAGGTAGAAAGCACCCTCCCATCTTAATCATAGTAATCATCGCCACTACCATTTACTGGG 
TGCCTATA7W\GGCCAGCCTCTTCATACACATGATCTCACTGAATCCTCATAGCATCTGC 
CTGCGACTGTTATTATCCCCATTTACAGATGAAGAAACTGAATCTTTGAACCCAGGTCAT 



132 CTGGGTTCCTATGTGGGGAGGTCATGCTCCCCACTCATTGAGCCCCCCCAGGCAAACCAC 
CTGGACAGCCAGACCCATGCAGACTCTGGAGCAGGTGGAGAGGAAGAGTGAGACCACCCC 
GCCTCACGGGC 
[G, A] 

GTGAAGGGCCGGCAGCCTCTGAATAGTCTCTGCTAGGAGGTAGAAAGCACCCTCCCATCT 
TAATCATAGTAATCATCGCCACTACCATTTACTGGGTGCCTATAAAAGGCCAGCCTCTTC 
ATACACATGATCTCACTGAATCCTCATAGCATCTGCCTGCGACTGTTATTATCCCCATTT 
ACAGATGAAGAAACTGAATCTTTGAACCCAGGTCATCTGGCTCTCAAACTTGTGCTGTTT 
TCCCTAAGCCACCCGGTCTCTCATTTCTCCCACTGAAATGTCTCACATGCCATTGCCCTT 



724 



ATTGCCCTTACTCATTTCTGCCCATGTCTCCTCCAAAACACCATTTATCAATTCGCTCAA 
CAAGTATGTGTTGAGTACACACTAAGGGCCAGGCGAGGGGCTGGGCACAGGCGCTGGGGG 
TAGGTTCATTCTCCCACCTTCGCTTCTGCTGGGTATCACCTGTGGGGTCTTGCCGGGCAT 
CCCACCCTCACCTGTAGTTCAAGTGGACCTTGGGATCCCAAGACCAAATGAATGGAATGC 
ACCAGCCCAGCCTTCACCAACTTGAGCACAATCTTATTCATAATAGAAACTCACATTTGC 
[A,C] 

TCACACTTTACATTTTACACAACCCCTTCTTATCCATTAACTCATTTGATCTTCACAACA 
ACCCTGTGAGATATGTCTGTTACTCCCACTTTAGTGATACAGAATCTGAGGTTTGAAAAG 
TAATGCTGACCATTCTGCCTCATTAATAAAAGCAGGATTAACCCAGGCTCCTGGACCCTT 
CCACAAAAGGCATTAAGCAACCTGCTCCCCTCTGACMCCTCCCCTGTCACCCAGGCTCT 
CCTCTGGGAAGTTGGGGGCATCTCTAGCCCCCAAGTAGTTACTCATTTTCAACCCCATCT 



1558 TCAGCTCTGCCCATCTCAGCTCCTGGAACGTCAGCCAGGTTGCGCAAAAAGTGAGGAGGA 
GAGGAGCGGCAGTACACAAGGGTGGGGGAAAGATTAGGCACAGGAAGCCGTGGGAGAGAG 
AGCCGGCAGGTGGACCATCCTGGTTTCCCCACACACACCATTGTCCCCCTGGGAAACCTG 
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TTGGTGAAGTTCTAGATGTCTTATCCAAGAAGGGTCCTCTTGAGGTCATCTCAGCTATCC 
CCCTGCCTCTAGGCAAGCTGTTTTCTGTTTCTTCCAAGCTGACTGGCTGAATGGTAGGAG 
[C,G] 

CTTTCTGCCAGGGAAACTAAGGTCTGGGAAGGGAGTATGGCTTGTGGGGACACCAGGGGT 
CAGGGGAGGGGAGGGTCCACCTGCTG/^ATCAAGTGGGGCCTCCTGCCCTCGTGATTCCCC 
TTTGCCTGGTGCTCAGTGGGGGTGATGGTGACGCCACAGGTGTGGAGTGCCAGCCACGTG 
CTGAGCGCCAAGCAAAACAGCCAGGGTGAGTCTATGCATCATCAGTGCCTGGGAAGGAAG 
GCCACTGCGAGCAGGGAGTCTGACGGAAAAACTTGACAGAGGGAAGGGAGGCACCTTGCT 

1577 CTCCTGGAACGTCAGCCAGGTTGCGCAAAAAGTGAGGAGGAGAGGAGCGGCAGTACACAA 
GGGTGGGGGAAAGATTAGGCACAGGAAGCCGTGGGAGAGAGAGCCGGCAGGTGGACCATC 
CTGGTTTCCCCACACACACCATTGTCCCCCTGGGAAACCTGTTGGTGAAGTTCTAGATGT 
CTTATCCAAGAAGGGTCCTCTTGAGGTCATCTCAGCTATCCCCCTGCCTCTAGGCAAGCT 
GTTTTCTGTTTCTTCCAAGCTGACTGGCTGAATGGTAGGAGCCTTTCTGCCAGGGAAACT 
[A, G] 

AGGTCTGGGAAGGGAGTATGGCTTGTGGGGACACCAGGGGTCAGGGGAGGGGAGGGTCCA 
CCTGCTGAATCAAGTGGGGCCTCCTGCCCTCGTGATTCCCCTTTGCCTGGTGCTCAGTGG 
GGGTGATGGTGACGCCACAGGTGTGGAGTGCCAGCCACGTGCTGAGCGCCAAGCAAAACA 
GCCAGGGTGAGTCTATGCATCATCAGTGCCTGGGAAGGAAGGCCACTGCGAGCAGGGAGT 
CTGACGGAAAAACTTGACAGAGGGAAGGGAGGCACCTTGCTTTATCGGGGCGGGGAAGGC 

2487 ACACGGCTTCTGCACTGGTATCCCTAAGATGGGGTTAAGGGAAGCCCTGGGGAAGTGAGG 
TTCTGAATGATGAATTTAAGATCCTACAACCTCATCTGTACTGAGACCCCCAGGGAGGAT 
GGGGAGCAGGAGCAAGAACCATCCAGAAGGGTTATATGGCATTCCCAAACCCCTGCATGG 
P=s CATCTCCCATATTCTCAATTCACCCGGGTCTCTCTGGGTTTGTTAAGGCATGGTAGATGA 
1? GCATCTACGTTATGGAGGGGTGGGGAGCATCAGAGCCCTTACTCCATGCCCTGTTCCCTC 
'*H [C,A] 

"N TTACAAAAAATACCTGAAGTTACCATCACCCCAGGTTCTTTGTCCTTTCCCTCCCGGATG 
=p TTCCTTCCTCCACTTGGTCCAGAGAATGCCAAAAGGAGGCCCTAAATTTCTGAACTTTCC 
j=| TGAGGGGACCTACCAGGGTGTAGTCCTACCAGCGCCCAGGGTCTTTCCACTCTCATCTCC 
5 a CTGGAAATGCGATGGTGGGTATGAAACCTTGTCCCTAAGTAGGCGCTACACAAGGTGATC 
^2 CATACCCACACCCCAGGAGGCTGGGGCTGCGGGTGTCACCCTCCCCATTCCCAGACTCCT 

03 2634 AGGGTTATATGGCATTCCCAAACCCCTGCATGGCATCTCCCATATTCTCAATTCACCCGG 

GTCTCTCTGGGTTTGTTAAGGCATGGTAGATGAGCATCTACGTTATGGAGGGGTGGGGAG 
5 " s CATCAGAGCCCTTACTCCATGCCCTGTTCCCTCCTTACAAAAAATACCTGAAGTTACCAT 
l! CACCCCAGGTTCTTTGTCCTTTCCCTCCCGGATGTTCCTTCCTCCACTTGGTCCAGAGAA 
TGCCAAAAGGAGGCCCTAAATTTCTGAACTTTCCTGAGGGGACCTACCAGGGTGTAGTCC 

iy [t,c] 

g3 ACCAGCGCCCAGGGTCTTTCCACTCTCATCTCCCTGGAAATGCGATGGTGGGTATGAAAC 
f4 CTTGTCCCTAAGTAGGCGCTACACAAGGTGATCCATACCCACACCCCAGGAGGCTGGGGC 
:f TGCGGGTGTCACCCTCCCCATTCCCAGACTCCTGGCAGACCTCCTCTGGCCCAGCTATAG 
^ GCCAACTCACTCTCCCTCACTCCCTTGGGGAAACGGCTGATTCAGTTACCTGGATTGAGG 
TCACTGGCAATGGCTGAAGTGGAGACGCAGGTGGAACTGGTTCAGGCCGGGGGAATCACC 

4352 ATTGGAGTTACCACACATAAAGGATAGTGAGTCAGCAGAGTGCACCCTGCAGGAACAATA 
GAGCCTTCCTTTTCAAGGAAGTTCTAAGAAAAATGGCAGCAGGCAGGCCCCACTCGGGTG 
TATTCACTCATTCATTTATTCAACAAATATTTACTAAGTGCCCCTGTGCAAGGCTCGAGG 
TGTACAAAGATGAACAGGAGAGCTAGACTTCTTGCCATGCGTGGTGGGGTTTGCTGCCTA 
GTGGGAGAGACAGACAAAAAGCAAGGAATGCACACACAGGATGCACACACAGCGGCAGGA 
[A, G] 

CCAAGGTGCAGTTACCCAGGCCTGGGATCAGACAGACAGGACTCAGAGGAGACTTTCCCA 
GAGAAAAGCCATCTGAGCCAAGGGATGGATCTGATACCTCCGAAGGCTGAGCCACCATAA 
CACTCATACCTTTAAGCCAAGTCTTATAAACTCCCCAGGTAAGCAGCTGGCAGTCAGAAG 
ACCTCCAGCTAATGCCCAGGACAAGTTGATGAGCTCTCAAGAAAAAGTTCCTGCCTTTTC 
TTCTCAATATCCCTGGCACACAGTTCAGTGAATTTTGAATGAACCAATGAATGAAATGAG 

5157 ATCCAGGTCCCACAAGGTGAAGGGGCTCCTTCAGCCAGGCCTGGATTGCCACTCCCCTCA 
CCATTCCTCTCCTCATCCCCACTCCATCCCTCTGTGATCCCCATAAGCTAGTCATGCTGC 
TGAGCTTCAGTCTCGTTGTCCTCTGCAGGCATGGCATTTGCTCTGCTGGCCAACCTTCCT 
GCAGTCAATGGCCTCTACTCCTCCTTCTTCCCCCTCCTGACCTACTTCTTCCTGGGGGGT 
GTTCACCAGATGGTGCCAGGTAAGGCCTCTCCCCTCTGGGCAGGCAGGATGACCCAGACC 
[A,C] 
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CAAGGATGGGAGGTGTGGCAAAGGGGCCTCGGGAGATTTTCCATCTGCATTCTCCTGGAG 
TTGTTCCTGGTCAGTCCTAGGGGAATGGTCACTGTGAATGTCATTTCCAGGTCCTCGGTG 
ACCTTGGAGAAACCACTGAGCCTCTTTGAGTTCAGTTAGCATTACCTGTTCCATCTTCCT 
CCTAGGAATGAGAGGAAGACTTAGCAGAACAAGATATACCATATGCTATAACATGCTTAA 
ACAGATGTGAGAAATCACCATCTAACTCCCTGGTTGGTCCCAGCCGGCCACTACAGGGAC 

5658 TTAGCAGAACAAGATATACCAT ATGCTATAACATGCTTAAACAGATGTGAGAAATCACCA 

TCTAACTCCCTGGTTGGTCCCAGCCGGCCACTACAGGGACATTTGGACTTCTCTGGTGCT 
AAGTGAGATGGAGGAAAGCCTGGTCACAAGGGCTGGTTTCTGGTTCAGGCTCTGCTTATA 
TTTCTTATTTCTGAGTTCATTTTCTCACGTGTCCTGTATGACAATATTGACCATTGGGGT 
AAAAGCACCTTGAAAAGCATAGATCATGGTTAGAGTGAGTGGTTGTTATTATTGTGTTGG 
[A,T] 

GAAGAGCCTTGGAGGTGCAGGGATCCATCCCCCTGGGGTCGGGAAGCATTCCTGGGCCCC 
TTTCTGGTTTCCATCGGTGTGGTTCAAACCTCTGATTTTTGCTGGCTGGGTGGGGCACCA 
CAGGTACCTTTGCCGTTATCAGCATCCTGGTGGGTAACATCTGTCTGCAGCTGGCCCCAG 
AGTCGAAATTCCAGGTCTTCT^ACAATGCCACCAATGAGAGCTATGTGGACACAGCAGCCA 
TGGAGGCTGAGAGGCTGCACGTGTCAGCTACGCTAGCCTGCCTCACTGCCATCATCCAGG 

5945 ATTATTGTGTTGGAGAAGAGCCTTGGAGGTGCAGGGATCCATCCCCCTGGGGTCGGGAAG 
CATTCCTGGGCCCCTTTCTGGTTTCCATCGGTGTGGTTCAAACCTCTGATTTTTGCTGGC 
TGGGTGGGGCACCACAGGTACCTTTGCCGTTATCAGCATCCTGGTGGGTAACATCTGTCT 
GCAGCTGGCCCCAGAGTCGAAATTCCAGGTCTTCAACAATGCCACCAATGAGAGCTATGT 
GGACACAGCAGCCATGGAGGCTGAGAGGCTGCACGTGTCAGCTACGCTAGCCTGCCTCAC 
[T,C] 

GCCATCATCCAGGTGAGGGGGCAGCCCCCAACCCTGCTAGAAGGGCATCAGACCACCCTG 
CCCCTCCCTCAAAGCCTTAGCTTTGATGCTAAATCTGATTTAGGGGGCTGGGTGTGGAGG 
^ CTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGAGGGTGGATCACTTGAGGTCAGG 
~~~4 AGTTTGAGACCACCTTGACCAACGTGATGAAACCCCATCTCTACCAAAAATACAAAAATA 
=p ATCCAGGCTTGGTAGTATGCGCCTGTAGTCCCACCTACTCAGGAGGCTGAGGCAGGAGAA 

{Z 6281 GCTAGAAGGGCATCAGACCACCCTGCCCCTCCCTCAAAGCCTTAGCTTTGATGCTAAATC 

TGATTTAGGGGGCTGGGTGTGGAGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGA 
W GGAGGGTGGATCACTTGAGGTCAGGAGTTTGAGACCACCTTGACCAACGTGATGAAACCC 
Jj CATCTCTACCAAAAATACAAAAATAATCCAGGCTTGGTAGTATGCGCCTGTAGTCCCACC 
= TACTCAGGAGGCTGAGGCAGGAGAATCACTTGAATCCGGGAGGCAGAGGTTGCAGTGAGC 

U [C ' T] 

!_ s GAGATCGCGCCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCCGTCTCAAAAAAAAA 
J AAAAAAAAAAAAAAAAAAACCCAAGTTAGGGCTCACCTCCTCCCTCCTCCCCATCCCAGG 
!=j GCTAAAGTGAACCTTGAAAATTAACAGTATCTCCTCATCTGCATGTAGCAGGACCATACA 
Ql AAAAAACAACAGCTGTACCTGGTTAAACTGTCCTGAGCTTTAAACCTGTAAAAGACTCAC 
AGCCTCTCTCCATTATCCCGTGGAGAAACCCAACTCTCTGCCAGCATAGTCTTGCAGACT 

^ 64 52 ATGAAACCCCATCTCTACCAAAAATACAAAAATAATCCAGGCTTGGTAGTATGCGCCTGT 

AGTCCCACCTACTCAGGAGGCTGAGGCAGGAGAATCACTTGAATCCGGGAGGCAGAGGTT 
GCAGTGAGCTGAGATCGCGCCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCCGTCT 
CAAAAAAAAAAAAAAAAAAAAAAAAAAAACCCAAGTTAGGGCTCACCTCCTCCCTCCTCC 
CCATCCCAGGGCTAAAGTGAACCTTGAAAATTAACAGTATCTCCTCATCTGCATGTAGCA 
[G,C] 

GACCATACAAAAAAACAACAGCTGTACCTGGTTAAACTGTCCTGAGCTTTAAACCTGTAA 
AAGACTCACAGCCTCTCTCCATTATCCCGTGGAGAAACCCAACTCTCTGCCAGCATAGTC 
TTGCAGACTGCTAATTTTCTCTAACATCCCTCACTCCGCTCCAGCCTCCTCTGCTCCAAG 
CCACAGCAGCAGTTGCACAACATAAATTGAGCTTCTGCAAATGGTTGCAAAGGATTCTGC 
TAGGTTTTATGAAGGGAAGCACAACATGACAGAATGCAAGAGCAAAACACAGTCCCAGAG 

6610 GTGACAGAGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAACCCAAGTTA 
GGGCTCACCTCCTCCCTCCTCCCCATCCCAGGGCTAAAGTGAACCTTGAAAATTAACAGT 
ATCTCCTCATCTGCATGTAGCAGGACCATACAAAAAAACAACAGCTGTACCTGGTTAAAC 
TGTCCTGAGCTTTAAACCTGTAAAAGACTCACAGCCTCTCTCCATTATCCCGTGGAGAAA 
CCCAACTCTCTGCCAGCATAGTCTTGCAGACTGCTAATTTTCTCTAACATCCCTCACTCC 
[T,G] 

CTCCAGCCTCCTCTGCTCCAAGCCACAGCAGCAGTTGCACAACATAAATTGAGCTTCTGC 
AAATGGTTGCAAAGGATTCTGCTAGGTTTTATGAAGGGAAGCACAACATGACAGAATGCA 
AGAGCAAAACACAGTCCCAGAGAGCGCCTTTTCATTCACTCATTCATTCGGTTTTGTGCC 
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AAGAACTAGGCTAAACCCTGGGATACAAAGATAAGTAAGAAAGAGGTCCAATTCACAAGT 
TGCTCACAGCCCAGCAGAGGAAGGAGCCATGTCAACAGATAAATTTGTATGCAGTGAGAT 

7247 GACACAGAGCAGAGTCACGGAGGACCTCAAAGAGGAGGTGACACTCCACCTCTCTTAAAG 
GATGAGAACTTAACCAGGAACAAGGTATACAGAGGATGGTCCAGGCAGAAGGGAACAGTG 
CCTAAAAACACTGAGGCCTGAGAGAGTGTGATCTGCGCAGGCAAAGTAAGGGGCTTGGTG 
TGGCTGGAGGGTAGAGGGCCCAGAAGAGGATGGAAAAGTAGGCAGGAGCCAGACAATGAG 
ATCTGGGGTCTGTTCTCTGACAGCGACTTTGGGTCTGATTGGCAGTTTATAAGGATCGTT 
[T,C] 

GGGCTACACAATGATGAGTGGGAGGTGGATTAGAATCAAGGCAGGGGACCTGTTGGGAGA 
CTCTGCAGAGGCCCAGGCAGGAATAATGCAGGCGAAGACCAGGTAGAGAAAGAGATGGGG 
CTGGACTTGAAAAGAATGTTTTACCAGGAGCTTGGTGATAGACTGGATGTGGGAGGTAAG 
GGAGGATGACTCTCAAGTTTTTGGTTGGGCAACCAGGTTAATGATGGTGTCATTTACTGA 
GAGAGAAAACACTGGGGGAGGACTAGACTTATTTTACAGATAAGCCAAAGCCAGAGAGGT 

7360 AACAGTGCCTAAAAACACTGAGGCCTGAGAGAGTGTGATCTGCGCAGGCAAAGTAAGGGG 
CTTGGTGTGGCTGGAGGGTAGAGGGCCCAGAAGAGGATGGAAAAGTAGGCAGGAGCCAGA 
CAATGAGATCTGGGGTCTGTTCTCTGACAGCGACTTTGGGTCTGATTGGCAGTTTATAAG 
GATCGTTTGGGCTACACAATGATGAGTGGGAGGTGGATTAGAATCAAGGCAGGGGACCTG 
TTGGGAGACTCTGCAGAGGCCCAGGCAGGAATAATGCAGGCGAAGACCAGGTAGAGAAAG 
[A,G] 

GATGGGGCTGGACTTGAAAAGAATGTTTTACCAGGAGCTTGGTGATAGACTGGATGTGGG 
AGGTAAGGGAGGATGACTCTCAAGTTTTTGGTTGGGCAACCAGGTTAATGATGGTGTCAT 
TTACTGAGAGAGAAAACACTGGGGGAGGACTAGACTTATTTTACAGATAAGCCAAAGCCA 
GAGAGGTGATGTGACAGAAAGGCCCATGCTCTAAAGGAGCTGAAGGTCTGATGGCAGCCA 
TGTAGAGCACAGTGAAGGGCAGGTGAAGGTCACAGATGGTCCAATTCCCTCAAGCTACTG 

7 64 4 GACCAGGTAGAGAAAGAGATGGGGCTGGACTTGAAAAGAATGTTTTACCAGGAGCTTGGT 
GATAGACTGGATGTGGGAGGTAAGGGAGGATGACTCTCAAGTTTTTGGTTGGGCAACCAG 
GTTAATGATGGTGTCATTTACTGAGAGAGAAAACACTGGGGGAGGACTAGACTTATTTTA 
CAGATAAGCCAAAGCCAGAGAGGTGATGTGACAGAAAGGCCCATGCTCTAAAGGAGCTGA 
AGGTCTGATGGCAGCCATGTAGAGCACAGTGAAGGGCAGGTGAAGGTCACAGATGGTCCA 
[A, T] 

TTCCCTCAAGCTACTGCTACGCTAGGACTGCACGGAGCTCCAGACCTGCGTGTGTGTGGG 
GCGGGTCGTTGGAACTGCTGAACCACATTGGTCTTCCGCCACCAACCACCCTTTTCCTCC 
TCTCAGATGGGTCTGGGCTTCATGCAGTTTGGCTTTGTGGCCATCTACCTCTCCGAGTCC 
TTCATCCGGGGCTTCATGACGGCCGCCGGCCTGCAGATCCTGATTTCGGTGCTCAAGTAC 
ATCTTCGGACTGACCATCCCCTCCTACACAGGCCCAGGGTCCATCGTCTTTGTGAGTCTG 

8127 CATCCGGGGCTTCATGACGGCCGCCGGCCTGCAGATCCTGATTTCGGTGCTCAAGTACAT 
CTTCGGACTGACCATCCCCTCCTACACAGGCCCAGGGTCCATCGTCTTTGTGAGTCTGGG 
GATGCACCCCTGCCATTGGAGCAAGGCTCCAGCAGACACATGAGGAGGATGTACTGTTTT 
AAGATGTCGTGAGCTCCTCATTGCAAGGGCTGGCTTAGCTGTTGTTCAGAGAGGATTCTG 
AGGGGGTTTCTGTCTTGGGAGGGTCAAAGTCATGACTCACAGAGGTTCTTGGTAGTTAAT 
[A,C] 

CCTGCAGAAAAGAGCTGTACATTCTCCGCCAGTTCCCCATTCTAGTGCCTCAACCCCTCC 
CTGCCTGGAAAGTCCTGCCTTATGTCTAATCTCCATCCCTCCTCCTTCAGCCCAAACTCT 
TCTAAAGAAAAAGAAAGCATTCCTTTTCTAGCACAAGTTCCCCATGTGCCTTTTGGGAAA 
GGGCGGTGGGCGACGGGACAGGGTTCCTGATCAGGGTTTTAATTCTGTCTTGGTGTGCCT 
CCATTAGCTTTGATGGCATCCCTTCCCTGGGTCAGACACCCAAAGGTGGGGTATTATGGG 

8317 GAGCTCCTCATTGCAAGGGCTGGCTTAGCTGTTGTTCAGAGAGGATTCTGAGGGGGTTTC 
TGTCTTGGGAGGGTCAAAGTCATGACTCACAGAGGTTCTTGGTAGTTAATACCTGCAGAA 
AAGAGCTGTACATTCTCCGCCAGTTCCCCATTCTAGTGCCTCAACCCCTCCCTGCCTGGA 
AAGTCCTGCCTTATGTCTAATCTCCATCCCTCCTCCTTCAGCCCAAACTCTTCTAAAGAA 
AAAGAAAGCATTCCTTTTCTAGCACAAGTTCCCCATGTGCCTTTTGGGAAAGGGCGGTGG 
[G,A] 

CGACGGGACAGGGTTCCTGATCAGGGTTTTAATTCTGTCTTGGTGTGCCTCCATTAGCTT 
TGATGGCATCCCTTCCCTGGGTCAGACACCCAAAGGTGGGGTATTATGGGAAGAAGGGGT 
GGGAGCCTGTGAGCATGATGCTCTTTCCCCCAGACCTTCATTGACATTTGCAAAAACCTC 
CCCCACACCAACATCGCCTCGCTCATCTTCGCTCTCATCAGCGGTGCCTTCCTGGTGCTG 
GTGAAGGAGCTCAATGCTCGCTACATGCACAAGATTCGCTTCCCCATCCCTACAGAGATG 
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TTCTACTGCTCTAATAATTCCCCCTAAGGAGGCAGGGGAGTGGGATTCAGGGTCCCCAGA 
GAAAAGGGAGACTTGAGAGAGACGCCTGCCCTGGCCCCACCTTAGGGCCAATCCCCATTC 
TCCACTCTGGGGTTTGCAGGTGGTGGTGGCAACAGCTATCTCCGGGGGCTGTAAGATGCC 
CAAAAAGTATCACATGCAGATCGTGGGAGAAATCCAACGCGGGTGAGTCCAGGTGGCCCA 
GAAGCCTGGCCCACCCGCACCTCATGCCCCACTAAGGCCTGAGCTCGGAGAGGGAGACAA 
[G, A] 

ATGAACTCTATGAAAGTGCAGTCGAAACTGTATGACACTGACCATGTATGAATTATTACT 
ATTACCGTTTCCTGAGAAGGGCCGCACAACCAGCCAATGTAGGCTATTTTATGAGAAATG 
AGTCTTAACTGCCACACTCCCCTTATAAATCTCATTCAACTGATGCTGTTAAACAAAGCC 
TCTCTG71ACAGCCGCTTGCTGGCTCTTTGCCTTGCTCTAATGCATTGGTTCTTTGTCCAT 
GTAGAAAGGGAACTATTAGGTTCAACCAGATTCATGAAGCATCCACTCTGTGCCAGGCAC 

AACTGATGCTGTTAAACAAAGCCTCTCTGAACAGCCGCTTGCTGGCTCTTTGCCTTGCTC 
TAATGCATTGGTTCTTTGTCCATGTAGAAAGGGAACTATTAGGTTCAACCAGATTCATGA 
AGCATCCACTCTGTGCCAGGCACCATGCTGGGCCCTGGGAGGAGAGGGGTGACGCTTGTC 
CTGCAGGGTTGGAACAGGCAAGGGAGGGAAGACCACATAGCACCAAAGGTCTAGGGGTCT 
GTGGACTCGTGAGCATACAGGGTTCAGAATCTGGGAGTTAACAAACGAGGCCCTACCACA 
[G, T] 

ACTGGCCCGGGGACCTTGGGCAAGTTAGGTTCTCTCAGCCTCAGTTTCCTCCTTTGTAAA 
ACAGGAGTGATGGTCCCTACCCTATGGGGTGGTGCTGAGGATTCAGACTGGATGGGATAA 
CTTAGGCAAAGATCCCGGCACACCATGGGGGCCTGGCTGGTCCCTGTGGGCTGGTGAAGG 
ACTTGGCTGCCCTCCCCACTCACACCCTTGGGTTCTGCCTCCTTCCTGGCTCCTCGGCAG 
GTTCCCCACCCCGGTGTCGCCTGTGGTCTCACAGTGGAAGGACATGATAGGCACAGCCTT 

AGCCCCACCATAACCTATGGGAGAGGATTTACTAAACTTTCTTAACGGTGATGAAACCAA 
GGCTCAGAATGGTTAAGTAAATTGTCAAAGGCCACAGAGGTAGGGAGTGGTAGAGTCTGG 
ATTAAAACTCCAAGTCCTGGACTCCAGACCTCTAGGCTGTACTGTCTCATAGGGAAGGCA 
GTCTCACCCACCTAGGGCAGAGAAGAAAATCCTTAAAGCCAGAGAAGTGAGTGGCTCATC 
TGTGGTCACCCAGAGAGACAGTGATGAGGACAGGGAGAAAAATTATACCTCAGTTCCCAG 
[C,G] 

CCAAGGATCTGCTTTGACCATAACCCAACAAGCCCCCGCTATGGTGGTATTTCCTTAGGT 
TCATATGGCGGCTTTTGTTTCCATTTGATCTTCACAGCAATTCTCTACAGGAATCTGGGC 
AGATTTATTTCCTTTAGAGGAATTTCCAGGTCTTAAAATCTATAGGGGGCAACTATCAAA 
ACTTCACCCAATGTTGCCCCCTACCCACACACAAAACCAGGCCCCCAGCCGATCAGAAAG 
CACTGCTGAGCTCCTGTCAGGGCCCACGCAGCTCGCTGTGAGACAGAGAGAGGGAACTCA 

GAAACCAAGGCTCAGAATGGTTAAGTAAATTGTCAAAGGCCACAGAGGTAGGGAGTGGTA 
GAGTCTGGATTAAAACTCCAAGTCCTGGACTCCAGACCTCTAGGCTGTACTGTCTCATAG 
GGAAGGCAGTCTCACCCACCTAGGGCAGAGAAGAAAATCCTTAAAGCCAGAGAAGTGAGT 
GGCTCATCTGTGGTCACCCAGAGAGACAGTGATGAGGACAGGGAGAAAAATTATACCTCA 
GTTCCCAGCCCAAGGATCTGCTTTGACCATAACCCAACAAGCCCCCGCTATGGTGGTATT 
[C,T] 

CCTTAGGTTCATATGGCGGCTTTTGTTTCCATTTGATCTTCACAGCAATTCTCTACAGGA 
ATCTGGGCAGATTTATTTCCTTTAGAGGAATTTCCAGGTCTTAAAATCTATAGGGGGCAA 
CTATCAAAACTTCACCCAATGTTGCCCCCTACCCACACACAAAACCAGGCCCCCAGCCGA 
TCAGAAAGCACTGCTGAGCTCCTGTCAGGGCCCACGCAGCTCGCTGTGAGACAGAGAGAG 
GGAACTCACATTTATTGATCACCTACTGAGCATCCATCACTAGGCTAGGACCGTCACATT 

ACCCACCTAGGGCAGAGAAGAAAATCCTTAAAGCCAGAGAAGTGAGTGGCTCATCTGTGG 
TCACCCAGAGAGACAGTGATGAGGACAGGGAGAAAAATTATACCTCAGTTCCCAGCCCAA 
GGATCTGCTTTGACCATAACCCAACAAGCCCCCGCTATGGTGGTATTTCCTTAGGTTCAT 
ATGGCGGCTTTTGTTTCCATTTGATCTTCACAGCAATTCTCTACAGGAATCTGGGCAGAT 
TTATTTCCTTTAGAGGAATTTCCAGGTCTTAAAATCTATAGGGGGCAACTATCAAAACTT 
[C,T] 

ACCCAATGTTGCCCCCTACCCACACACAAAACCAGGCCCCCAGCCGATCAGAAAGCACTG 
CTGAGCTCCTGTCAGGGCCCACGCAGCTCGCTGTGAGACAGAGAGAGGGAACTCACATTT 
ATTGATCACCTACTGAGCATCCATCACTAGGCTAGGACCGTCACATTCCTTAACTTTTGA 
ATCCTTTCATGAGGTAGGCATTATTATTCTCCTTTTGTTTCACATAGCCATTAAAGAACA 
AAATTTGGGGCTGGGTGTGCTGACTCACACCTGTGATCTAGCACTTTAGGGGGCTGAGGC 

CTAACTATTCAGGAAGGTTAGGCGGGAGCACAACTTGGGTTCCAGGGTTTGAGGCTCCAG 
TGAGCTGATCTTGCCACTGCACTACAGCCTGAGCAACAGAGCAAGACCCTGTGACTCCAA 
AAACAAACAAACAAACACATTTTGAACCCAAACAGATCTGACCCAAGATGCATGCTCTTA 




TAGATGCCACCTCCCTGTGTGCTGGGGCTTCTACTAAAAACACAGACAAGATCAGGCAAC 
CACAGTCAATCTAAGGGAAAGAGGAAAGTGTAACCAAAGCACAAATACATAAAATATTGC 
[-,A] 

AAAATGCTATTTAAAGAAAAAAAAGAGAAGAGAGGCTCTGAGGTTGTACTAACAGAGAAT 
GGCCTTGGCTAATCCAGGAAGACTTCCTGAAAGAGGTTGTTTTTTCCCCAGGTCTGCTTT 
TGACATCTCTCTTTTCACAGTGCATCTGGGTAGTGAGCTTCCTCTCCTCCTTCTTCCTCA 
GCCTGCCCTATGGTGTGGCAGTGGGTGTCGCCTTCTCCGTCCTGGTCGTGGTCTTCCAGA 
CTCAGTTGTAAGTGATAGCTTCCGCCCTCCTAGGCCCACAGTCGGTTCCCTGGGCCAGCC 

13257 GTGAGCTGATCTTGCCACTGCACTACAGCCTGAGCAACAGAGCAAGACCCTGTGACTCCA 
AAAACAAACAAACAAACACATTTTGAACCCAAACAGATCTGACCCAAGATGCATGCTCTT 
ATAGATGCCACCTCCCTGTGTGCTGGGGCTTCTACTAAAAACACAGACAAGATCAGGCAA 
CCACAGTCAATCTAAGGGAAAGAGGAAAGTGTAACCAAAGCACAAATACATAAAATATTG 
CAAAAATGCTATTTAAAGAAAAAAAAGAGAAGAGAGGCTCTGAGGTTGTACTAACAGAGA 
[A, G] 

TGGCCTTGGCTAATCCAGGAAGACTTCCTGAAAGAGGTTGTTTTTTCCCCAGGTCTGCTT 
TTGACATCTCTCTTTTCACAGTGCATCTGGGTAGTGAGCTTCCTCTCCTCCTTCTTCCTC 
AGCCTGCCCTATGGTGTGGCAGTGGGTGTCGCCTTCTCCGTCCTGGTCGTGGTCTTCCAG 
ACTCAGTTGTAAGTGATAGCTTCCGCCCTCCTAGGCCCACAGTCGGTTCCCTGGGCCAGC 
CCGCAAAGGGCTTCCATGCCACGGCCTGGCTTAGTCCACTGTACCTTCCACCTCTGGGCC 

14541 TCATGGACACTGACATTTATGTGAATCCCAAGACCTATAATAGGGTAGGTAATTCAAGCT 
TATGACCTCCTTCTTTTGCTCTGCACCACCCCAAGAAGAGGTTGCTTTTTAAAGCCAATA 
AAGACATTTCTGCAACTTGAGCTCAGTCTCCCTGTCACAGGCCCAGGATATCCAGGGGAT 
TAAAATCATCACGTACTGCTCCCCTCTCTACTTTGCCAACTCAGAGATCTTCAGGCAAAA 
GGTCATCGCCAAGGTAAGGCTCAGTCCCTGGCGACCAGAGGCTCTGGACAGAGAGTGGCC 
[G,A] 

GAAAATGGAAGCAGAAGGGCGGTGGGAGCTGAGAATAGGCCACTCCCATAGAGGGTGGAG 
GTCAAGATTGCTGTTGGCTCTCTCCCTGCAGAGAGGCATGGACCCCCAGAAAGTATTACT 
AGCCAAGCAAAAATACCTCAAGAAGCAGGAGAAGCGGAGAATGAGGCCCACACAACAGAG 
GAGGTCTCTATTCATGAAAACCAAGGTGAATGAAGGCCAGAAGCAGCCCCGTGCCCTGCT 
CTCCTGCCCATTCTGATACTGCCCCCTGTTACTCATGGTACCCTGGGGGCCCCGCTTCCC 

14545 GGACACTGACATTTATGTGAATCCCAAGACCTATAATAGGGTAGGTAATTCAAGCTTATG 
ACCTCCTTCTTTTGCTCTGCACCACCCCAAGAAGAGGTTGCTTTTTAAAGCCAATAAAGA 
CATTTCTGCAACTTGAGCTCAGTCTCCCTGTCACAGGCCCAGGATATCCAGGGGATTAAA 
ATCATCACGTACTGCTCCCCTCTCTACTTTGCCAACTCAGAGATCTTCAGGCAAAAGGTC 
ATCGCCAAGGTAAGGCTCAGTCCCTGGCGACCAGAGGCTCTGGACAGAGAGTGGCCGGAA 
[A,G] 

ATGGAAGCAGAAGGGCGGTGGGAGCTGAGAATAGGCCACTCCCATAGAGGGTGGAGGTCA 
AGATTGCTGTTGGCTCTCTCCCTGCAGACAGGCATGGACCCCCAGAAAGTATTACTAGCC 
AAGCAAAAATACCTCAAGAAGCAGGAGAAGCGGAGAATGAGGCCCACACAACAGAGGAGG 
TCTCTATTCATGAAAACCAAGGTGAATGAAGGCCAGAAGCAGCCCCGTGCCCTGCTCTCC 
TGCCCATTCTGATACTGCCCCCTGTTACTCATGGTACCCTGGGGGCCCCGCTTCCCACCC 

15041 ACCAAGGTGAATGAAGGCCAGAAGCAGCCCCGTGCCCTGCTCTCCTGCCCATTCTGATAC 
TGCCCCCTGTTACTCATGGTACCCTGGGGGCCCCGCTTCCCACCCTGACAGGCAAAGACA 
GAAAGTCTCTGGGAACACTGCCTGGTGGCCGCTGGGCATTTTTCTTCTTTTTTTTCTTTT 
TCTTTTTAGAGATGGAATTTTGCTCTTGTCACCCAGGCTTGAGTGCAATGGCGTTATCTT 
GGCTCACTGCAACCTCCACCTCTGGGGTTCAAGCGATTCTCCTGCCTTAGCCTCCCAAGT 
[C,A] 

GCTGAGATTACAGGTGCCACCACACCCAGCTAATTTTTGTATTTTTAGTAGATATTGGGT 
TTCACCATGTTGGCCAGGCTGGTGTCAAACTCCTGACCTCAGGTGATCCACCTACCTTAG 
CCTTCCAAAGTGCTGGGATTACAAGCCTGAGCCACTGCGCCCAGCCTGGGCATTTTTCTT 
CTTGGATGAGGTGCTACCATCTCCCAGGGAAGCCACTGAACCCCCAAGGCCCTTCTCCAT 
TTTCTGGCTAAGATAGGACATGGCCCATGGACTTTTGAACAACCCAGAGGGGGAACAGCA 

15053 GAAGGCCAGAAGCAGCCCCGTGCCCTGCTCTCCTGCCCATTCTGATACTGCCCCCTGTT A 

CTCATGGTACCCTGGGGGCCCCGCTTCCCACCCTGACAGGCAAAGACAGAAAGTCTCTGG 
GAACACTGCCTGGTGGCCGCTGGGCATTTTTCTTCTTTTTTTTCTTTTTCTTTTTAGAGA 
TGGAATTTTGCTCTTGTCACCCAGGCTTGAGTGCAATGGCGTTATCTTGGCTCACTGCAA 
CCTCCACCTCTGGGGTTCAAGCGATTCTCCTGCCTTAGCCTCCCAAGTCGCTGAGATTAC 
[A,C] 
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GGTGCCACCACACCCAGCTAATTTTTGTATTTTTAGTAGATATTGGGTTTCACCATGTTG 
GCCAGGCTGGTGTCAAACTCCTGACCTCAGGTGATCCACCTACCTTAGCCTTCCAAAGTG 
CTGGGATTACAAGCCTGAGCCACTGCGCCCAGCCTGGGCATTTTTCTTCTTGGATGAGGT 
GCTACCATCTCCCAGGGAAGCCACTGAACCCCCAAGGCCCTTCTCCATTTTCTGGCTAAG 
ATAGGACATGGCCCATGGACTTTTGAACAACCCAGAGGGGGAACAGCAGTGAATTTCCTG 

15065 CAGCCCCGTGCCCTGCTCTCCTGCCCATTCTGATACTGCCCCCTGTTACTCATGGTACCC 
TGGGGGCCCCGCTTCCCACCCTGACAGGCAAAGACAGAAAGTCTCTGGGAACACTGCCTG 
GTGGCCGCTGGGCATTTTTCTTCTTTTTTTTCTTTTTCTTTTTAGAGATGGAATTTTGCT 
CTTGTCACCCAGGCTTGAGTGCAATGGCGTTATCTTGGCTCACTGCAACCTCCACCTCTG 
GGGTTCAAGCGATTCTCCTGCCTTAGCCTCCCAAGTCGCTGAGATTACAGGTGCCACCAC 
[A, G] 

CCCAGCTAATTTTTGTATTTTTAGTAGATATTGGGTTTCACCATGTTGGCCAGGCTGGTG 
TCAAACTCCTGACCTCAGGTGATCCACCTACCTTAGCCTTCCAAAGTGCTGGGATTACAA 
GCCTGAGCCACTGCGCCCAGCCTGGGCATTTTTCTTCTTGGATGAGGTGCTACCATCTCC 
CAGGGAAGCCACTGAACCCCCAAGGCCCTTCTCCATTTTCTGGCTAAGATAGGACATGGC 
CCATGGACTTTTGAACAACCCAGAGGGGGAACAGCAGTGAATTTCCTGGGGAACCCAGGC 

15108 TGTTACTCATGGTACCCTGGGGGCCCCGCTTCCCACCCTGACAGGCAAAGACAGAAAGTC 
TCTGGGAACACTGCCTGGTGGCCGCTGGGCATTTTTCTTCTTTTTTTTCTTTTTCTTTTT 
AGAGATGGAATTTTGCTCTTGTCACCCAGGCTTGAGTGCAATGGCGTTATCTTGGCTCAC 
TGCAACCTCCACCTCTGGGGTTCAAGCGATTCTCCTGCCTTAGCCTCCCAAGTCGCTGAG 
ATTACAGGTGCCACCACACCCAGCTAATTTTTGTATTTTTAGTAGATATTGGGTTTCACC 
[A,C] 

TGTTGGCCAGGCTGGTGTCAAACTCCTGACCTCAGGTGATCCACCTACCTTAGCCTTCCA 
AAGTGCTGGGATTACAAGCCTGAGCCACTGCGCCCAGCCTGGGCATTTTTCTTCTTGGAT 
GAGGTGCTACCATCTCCCAGGGAAGCCACTGAACCCCCAAGGCCCTTCTCCATTTTCTGG 
CTAAGATAGGACATGGCCCATGGACTTTTGAACAACCCAGAGGGGGAACAGCAGTGAATT 
TCCTGGGGAACCCAGGCAGCCCAGGGCTAGCAAGGCTGGGGTGGCCATGGCAGTAATCCT 

16274 CTTCCAGACTGTCTCCCTGCAGGAGCTGCAGCAGGACTTTGAGAATGCGCCCCCCACCGA 
CCCCAACAACAACCAGACCCCGGCTAACGGCACCAGCGTGTCCTATATCACCTTCAGCCC 
TGACAGCT.CCTCACCTGCCCAGAGTGAGCCACCAGCCTCCGCTGAGGCCCCCGGCGAGCC 
CAGTGACATGCTGGCCAGCGTCCCACCCTTCGTCACCTTCCACACCCTCATCCTGGACAT 
GAGTGGAGTCAGCTTCGTGGACTTGATGGGCATCAAGGCCCTGGCCAAGGTGAGGCCCTC 
[-,G] 

GGGACAGCAAGCACCACCCACTCCACCCCCTCCGCTCTGCTCTCCACATTCCCTTTCCTG 
GGAGCCCTCATTTCAGGAAGCTGAGGGAGGAAGCTCACTGGGGAGACTAACAGCTCCTAG 
GAATCCCTCCTTTCCCCAGACGCCACCAGGTTGAGACATTCTCCACAGAGCAGGCCCAGA 
CGGCCCATGACAATGAGTGGCGGGACAAGTCTACCAGAGTTTCAGGCCCCTGTGCTCCCA 
ACACCCCCAGCAGTGGCCATCCCAAGTCCCTCTCAGCCATCAGGAACCCACCCAGGTTCT 

17424 AACATGGTGAAACCCCGCCTCTACTAAAAATACAAAAATTAGCCAGGTGTGGTGACGGGC 
CCCTGTAGTCCCAGCTACTCGGTAGGCTGAGGCAGAGAATTGCTTGAACCCAGGAGGCGG 
AGGTTGCAGTGAGCCAAGATCGCGCCACTGCACTCCAGCCTGGGCAACAGAGTGAAACTC 
CATCTCAAAAGAAAAAAAAAGAAAATATCTAGCCCCACAAGAAGGGGCCATGGTGACTTT 
AAGTGCCCGCCACGTTGGCAAAAGTCCATTTCCGCTCCACTTCCCAGAGAAACCGTCAGC 
[C,T] 

AACACTCCAGGGAGAAGTGGTGTGCTTTGCTGCTATTTTTGTCTTTGGCTGCTGGGCTCT 
CAGGGTTGCTTATTTGTTTGGCTTCCCCTCTGAAGTACGTTTTGTGAATCACTTTTGAGA 
CCCACTCAGAACATTCCTTTCCTTTTGCCTCCCTACCCCAACAACACTTCTAGCTGAGCT 
CCACCTATGGGAAGATCGGCGTGAAGGTCTTCTTGGTGAACATCCATGGTAAGAGAAAGA 
GGACATTTAGGGACTGAAAGACTGGCAAGGAGTGTGGGGTAGGAACAGGTTGGTGGGGTC 

17 627 AATATCTAGCCCCACAAGAAGGGGCCATGGTGACTTTAAGTGCCCGCCACGTTGGCAAAA 
GTCCATTTCCGCTCCACTTCCCAGAGAAACCGTCAGCCAACACTCCAGGGAGAAGTGGTG 
TGCTTTGCTGCTATTTTTGTCTTTGGCTGCTGGGCTCTCAGGGTTGCTTATTTGTTTGGC 
TTCCCCTCTGAAGTACGTTTTGTGAATCACTTTTGAGACCCACTCAGAACATTCCTTTCC 
TTTTGCCTCCCTACCCCAACAACACTTCTAGCTGAGCTCCACCTATGGGAAGATCGGCGT 
[G, A] 

AAGGTCTTCTTGGTGAACATCCATGGTAAGAGAAAGAGGACATTTAGGGACTGAAAGACT 
GGCAAGGAGTGTGGGGTAGGAACAGGTTGGTGGGGTCTGAATAGTGAGGAGGTTGGAAAC 
GAGAGCACCCAGCTATCCCCCACAAGCTGCTGCCTGCTCATAAAAGCTTCAGGTACAAGT 
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CCAAAGAGACTGGTCAGATTGCATAAACATCCTAGGGGCCTTAGTGACAGAGTGGGGGTG 
AGGAGGTCATGGAGTTACAGAAGGACAGCTAGGATTCTAATCTACCCC ATTRACT AATTTG 

18427 GGGTGCATATACACAGCCTCAAGGACGTGGCCACAGGGCAGCAGACATTTACATGACTAG 
CATGTACGCAAAGTGCAGAGATGTGGGAGCAAGTGCACACAGACACACAGGAGAATGTGA 
AGGGGCACATACACACACACCCAGCTCCCTGCACTGGGTCAGACCCCCTCCAGCAGGGCT 
GCAGTTCCCAAGCTCCGCATGGCCACGTTCGGGGAGAGAATCTGCAGTGGCAATGACCTG 
CTATGATATGTTCTGGAGTTAGAAGCAGTGGATTCTCCCCAACCTCACTGGACACCCCCT 
[T,C] 

AGGAAACCATCTCTAGGATTAAGAGTAATCCACACAAACTTCCAATGCCACACATTGGAA 
GTTGCTGGAAAGGTCTGGGAAAACAAGAGGAAGGATGGGTCCTTGGGGGATAGAACTGGC 
AGCGGCCTCTTCAAGGATGGCTTAGGCTTTTCCACTCGAATCACCACAAAGTACTGACTC 
CCTAAATCT^AACTGCTTCCTTCTGCTCTGGGTTGAAACTTCAGCATCCTCAAGTTCATGT 
TGCCCTCTGCCGTCCAGAACTGATATTGCACTGCCAATGCCATGGCCCTCAGATACAGCA 

18813 AGAGGAAGGATGGGTCCTTGGGGGATAGAACTGGCAGCGGCCTCTTCAAGGATGGCTTAG 
GCTTTTCCACTCGAATCACCACAAAGTACTGACTCCCTAAATCAAACTGCTTCCTTCTGC 
TCTGGGTTGAAACTTCAGCATCCTCAAGTTCATGTTGCCCTCTGCCGTCCAGAACTGATA 
TTGCACTGCCAATGCCATGGCCCTCAGATACAGCAAGAGCTGGGACCTCAGGCCTCTCCC 
ATCCCTGCTCTGGTCTCACTATCTTCCCCACCCCCAGCTCCAATCCACAATGGCTGTTAT 
[C,G] 

TTTCTGAAGGTGATCTTTTCTCCTTCTAGCCCAGGTGTACAATGACATTAGCCATGGAGG 
CGTGTTTGAGGATGGGAGTCTAGAATGCAAGCACGTCTTTCCCAGCATACATGACGCAGT 
CCTCTTTGCCCAGGCAAATGCTAGAGACGTGACCCCAGGACACAACTTCCAAGGGGTAAG 
_ GTTCTTGCACCTGGGGAATCCTAGGCTCCAAGGCACTGAAATAGCAGGACCAAGAGGCAT 
y TATTAGAAAGAACACAGGAGAAGGTTTAAGTTCCAATATCAAGTCTGCCATTTCAGTTTT 

%J 19035 GGACCTCAGGCCTCTCCCATCCCTGCTCTGGTCTCACTATCTTCCCCACCCCCAGCTCCA 

ATCCACAATGGCTGTTATCTTTCTGAAGGTGATCTTTTCTCCTTCTAGCCCAGGTGTACA 
IT ATGACATTAGCCATGGAGGCGTCTTTGAGGATGGGAGTCTAGAATGCAAGCACGTCTTTC 
^ CCAGCATACATGACGCAGTCCTCTTTGCCCAGGCAAATGCTAGAGACGTGACCCCAGGAC 
U1 ACAACTTCCAAGGGGTAAGGTTCTTGCACCTGGGGAATCCTAGGCTCCAAGGCACTGAAA 
H [T,C] 

AGCAGGACCAAGAGGCATTATTAGAAAGAACACAGGAGAAGGTTTAAGTTCCAATATCAA 
~ GTCTGCCATTTCAGTTTTCTGAATCTGTTTCCTTATCTATAGAATGAGCACCATCAACTA 
5 ACATTACCTACCTCTCTGCATTTTTCTTTTATTTTGTTTTAGGGTTAAATGATAATTACA 
M TCTTTTGTGTCACTTGAAAGCACTTTGTGTATTGTAAAAATTCTTTATCAATATAAGTTT 
flj TCTGGTTGCACAAACACCCAAAGCATAGTAGAGCAGGCCCACTCTGCTGGCATCGTTCCC 

19182 AGGATGGGAGTCTAGAATGCAAGCACGTCTTTCCCAGCATACATGACGCAGTCCTCTTTG 
CCCAGGCAAATGCTAGAGACGTGACCCCAGGACACAACTTCCAAGGGGTAAGGTTCTTGC 
Uj ACCTGGGGAATCCTAGGCTCCAAGGCACTGAAATAGCAGGACCAAGAGGCATTATTAGAA 
Q AGAACACAGGAGAAGGTTTAAGTTCCAATATCAAGTCTGCCATTTCAGTTTTCTGAATCT 
GTTTCCTTATCTATAGAATGAGCACCATCAACTAACATTACCTACCTCTCTGCATTTTTC 
[T,C] 

TTTATTTTGTTTTAGGGTTAAATGATAATTACATCTTTTGTGTCACTTGAAAGCACTTTG 
TGTATTGTAAAAATTCTTTATCAATATAAGTTTTCTGGTTGCACAAACACCCAAAGCATA 
GTAGAGCAGGCCCACTCTGCTGGCATCGTTCCCTGCCTCCTCCTCATCTCTTTCTAAAGG 
GGGCTTTCGGGAAGGGAGGGGAGGGGAGTAAGCCTACCCATTTTAACTTACCGGAGCTTA 
GAGATTTCAGGCTGGTGAGGGATAAAGAGATTGGGTCTGAGTTTTGTCTCAGCTTTTTGA 

19508 TAATTACATCTTTTGTGTCACTTGAAAGCACTTTGTGTATTGTAAAAATTCTTTATCAAT 
ATAAGTTTTCTGGTTGCACAAACACCCAAAGCATAGTAGAGCAGGCCCACTCTGCTGGCA 
TCGTTCCCTGCCTCCTCCTCATCTCTTTCTAAAGGGGGCTTTCGGGAAGGGAGGGGAGGG 
GAGTAAGCCTACCCATTTTAACTTACCGGAGCTTAGAGATTTCAGGCTGGTGAGGGATAA 
AGAGATTGGGTCTGAGTTTTGTCTCAGCTTTTTGACATTTAATTTACTAGCTCAGTAAGT 
[-,G,C] 

ATACAAATGGGATACAAATAACACCATCTAAAACTCCAGAAGACTGGGGAGTCAGAAAAA 
TCCTACCTCCTTGGGGTCCCTGCCCAGATCCCCAGTCATCTCTAGCCCTCAGGGTCCCCT 
CCCAGCTCAGCTCCTGCCCTTGGCCTCCCAAGACTCTTGTTGTGCCCCAGCCCTGGGTAA 
AAACCTCCCCTGCCCTCTGTGGGTCATAAGAAAGGCTTTTCTGGCCCTAGAGCAATGATT 
TGCTCTTTGCCTTAAGAGACTGATGAAGGTGAAACCATCTGTTCTAAGTGCTGAAAGACT 
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AGTTTTCTGGTTGCACAAACACCCAAAGCATAGTAGAGCAGGCCCACTCTGCTGGCATCG 
TTCCCTGCCTCCTCCTCATCTCTTTCTAAAGGGGGCTTTCGGGAAGGGAGGGGAGGGGAG 
TAAGCCTACCCATTTTAACTTACCGGAGCTTAGAGATTTCAGGCTGGTGAGGGATAAAGA 
GATTGGGTCTGAGTTTTGTCTCAGCTTTTTGACATTTAATTTACTAGCTCAGTAAGTCAT 
ACAAATGGGATACAAATAACACCATCTAAAACTCCAGAAGACTGGGGAGTCAGAAAAATC 
[T,G,C] 

TACCTCCTTGGGGTCCCTGCCCAGATCCCCAGTCATCTCTAGCCCTCAGGGTCCCCTCCC 
AGCTCAGCTCCTGCCCTTGGCCTCCCAAGACTCTTGTTGTGCCCCAGCCCTGGGTAAAAA 
CCTCCCCTGCCCTCTGTGGGTCATAAGAAAGGCTTTTCTGGCCCTAGAGCAATGATTTGC 
TCTTTGCCTTAAGAGACTGATGAAGGTGAAACCATCTGTTCTAAGTGCTGAAAGACTGCC 
CAGGAACACACAGGGCGCTGGCTCCTGCCCTCCATGCCTAGAGGGAAACCCTGGGGAAAC 

GCCTAGAGGGAAACCCTGGGGAAACAACGGGCTTTCCTGCTTCGTGAAATTTGTCCGCAG 
AGCAAAGAGGGAGATTCTGGAGGAAGCTGCATTAGTTGTTAGTGCCCTAATCATGTTCAG 
CTACTCTAGTTGGTATGTATACTTGATTAGTCATAGCACTTATAAATAATTTATATTTTA 
TATAATATATACTTACATATTATAGACCATTCACAGATACAAATCACACACATAAACACA 
CACCTTTTCAACAGCATTGTGAGGGACAAAGCAGGCAAAGTGAGGCTGGTTATCAGACTT 
[T,G] 

AACAGATTAGAAAATATATTCCCAGGAGGACAGGAATTCCCCAAGGTCAGGCAGCTAGCC 
AATAGTTTTTCTAAGCTGAGTAAAACCTTCCCTGCCTCTAACGGCCCACAAAGGAGGGAA 
GACCGCGATACACACCTGTGTGGTATAAGGGGGAAGACCACAGCCGTGCTGTTTTTGTGA 
GGCAGGTAAGGGAAGGGGCAAGAGGATAAGTCATGTGTCAGGAAGCAGCGTCCAACCAGA 
GCCGGCCACCTGTCCCTTTTCCTGCCACCATGCACCAACTTTGCTGTTCAGTCACTGAAG 

TTCCTGCTTCGTGAAATTTGTCCGCAGAGCAAAGAGGGAGATTCTGGAGGAAGCTGCATT 
AGTTGTTAGTGCCCTAATCATGTTCAGCTACTCTAGTTGGTATGTATACTTGATTAGTCA 
TAGCACTTATAAATAATTTATATTTTATATAATATATACTTACATATTATAGACCATTCA 
CAGATACAAATCACACACATAAACACACACCTTTTCAACAGCATTGTGAGGGACAAAGCA 
GGCAAAGTGAGGCTGGTTATCAGACTTTAACAGATTAGAAAATATATTCCCAGGAGGACA 
[G, A] 

GAATTCCCCAAGGTCAGGCAGCTAGCCAATAGTTTTTCTAAGCTGAGTAAAACCTTCCCT 
GCCTCTAACGGCCCACAAAGGAGGGAAGACCGCGATACACACCTGTCTGGTATAAGGGGG 
AAGACCACAGCCGTGCTGTTTTTGTGAGGCAGGTAAGGGAAGGGGCAAGAGGATAAGTCA 
TGTGTCAGGAAGCAGCGTCCAACCAGAGCCGGCCACCTGTCCCTTTTCCTGCCACCATGC 
ACCAACTTTGCTGTTCAGTCACTGAAGCTCATTCTGCACTGGCTTCCTCCCTTCCAGGCT 

TGTCTGGTATAAGGGGGAAGACCACAGCCGTGCTGTTTTTGTGAGGCAGGTAAGGGAAGG 
GGCAAGAGGATAAGTCATGTGTCAGGAAGCAGCGTCCAACCAGAGCCGGCCACCTGTCCC 
TTTTCCTGCCACCATGCACCAACTTTGCTGTTCAGTCACTGAAGCTCATTCTGCACTGGC 
TTCCTCCCTTCCAGGCTCCAGGGGATGCTGAGCTCTCCTTGTACGACTCAGAGGAGGACA 
TTCGCAGCTACTGGGACTTAGAGCAGGTGAGCTGAGGGAAGGGGCTGTGAGGGTGGGAGC 
[A,T] 

GGGCGAAGAGGGGAAGGATGGGGTCGCTGTCAAATACAAGGCGTTCACTCAGCTGTCTCA 
CCTCCAGCCCAGAGCAGTCACATTCAAGGCCACAAAGATTTGTGGTCATCTTTGTTTTTT 
TTCTTTTCCTTTTCTTTTTTTTTTTTTTTTAATTTGAGACAAAGTCTCACTCTATCACCC 
AGACTGGAATGCAGTGGCATGATCTCAGCTCACTGCAACCTCTGCCTCCCGGGTTCCAGA 
GGTTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTTCAGGCCTGCGCCCAGCTAATTTT 

ATGCACCAACTTTGCTGTTCAGTCACTGAAGCTCATTCTGCACTGGCTTCCTCCCTTCCA 
GGCTCCAGGGGATGCTGAGCTCTCCTTGTACGACTCAGAGGAGGACATTCGCAGCTACTG 
GGACTTAGAGCAGGTGAGCTGAGGGAAGGGGCTGTGAGGGTGGGAGCAGGGCGAAGAGGG 
GAAGGATGGGGTCGCTGTCAAATACAAGGCGTTCACTCAGCTGTCTCACCTCCAGCCCAG 
AGCAGTCACATTCAAGGCCACAAAGATTTGTGGTCATCTTTGTTTTTTTTCTTTTCCTTT 
[T,C] 

CTTTTTTTTTTTTTTTTAATTTGAGACAAAGTCTCACTCTATCACCCAGACTGGAATGCA 
GTGGCATGATCTCAGCTCACTGCAACCTCTGCCTCCCGGGTTCCAGAGGTTCTCCTGCCT 
CAGCCTCCCGAGTAGCTGGGACTTCAGGCCTGCGCCCAGCTAATTTTTGTATTTTTAGTA 
GAGACAGCTTTTCACCATGTTGGCTGGGCTGGTCTCGAACTTCCGATCTCAAGCAATCTG 
CCTGCCTCGGTCTCCTAAGTGCCTGGATTACAGGCATAAGCCACGATGCCTGGCCTTTGT 

GGGGAAGGATGGGGTCGCTGTCAAATACAAGGCGTTCACTCAGCTGTCTCACCTCCAGCC 
CAGAGCAGTCACATTCAAGGCCACAAAGATTTGTGGTCATCTTTGTTTTTTTTCTTTTCC 
TTTTCTTTTTTTTTTTTTTTTAATTTGAGACAAAGTCTCACTCTATCACCCAGACTGGAA 
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TGCAGTGGCATGATCTCAGCTCACTGCAACCTCTGCCTCCCGGGTTCCAGAGGTTCTCCT 
GCCTCAGCCTCCCGAGTAGCTGGGACTTCAGGCCTGCGCCCAGCTAATTTTTGTATTTTT 
[A, G] 

GTAGAGACAGCTTTTCACCATGTTGGCTGGGCTGGTCTCGAACTTCCGATCTCAAGCAAT 
CTGCCTGCCTCGGTCTCCTAAGTGCCTGGATTACAGGCATAAGCCACGATGCCTGGCCTT 
TGTTTTCATTCTTCTCACTCCCTGAAAGGCATCGTGGGGAGAGGGTGAGTCACTGGACCA 
AGTCCTAGAGAACCAGTATCTATTCTTATTCTCCAACACATCACCCACGTGACCCTGAGC 
AAGCCACATACACCCTGGGCCCTAGTTTTTATCATCTGTGAAATTAGGGGAAACATAGGT 

21787 GGGTGCAATGGTTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCGGACCAC 
CTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGCGAAACCCCGTCTCTACTAAAA 
GCACAAAAATTAGCCAGGCGTAGTGGTGCATGCCTGTAGTCCCAGCTACTCGGAAGTCTG 
AGGCATGAGAATCACTTGAACCTGGGAGGCAGATGTTGCAGTGAGCCGAGATCGTGCCAC 
TGCACTCCAGCTTGGGTGACAGAGCTAGACTGTCTCAAAAACAAACAAACAAAACAAAAC 
E-,A,Cj 

TAAAAGATATGTGGATATGAGGGATCACCATCCCCATAGGGCCCCTGGATTAACACCACC 
CCACCAATGCCCTGAATTAAAAGAAACCAGATGACTAGGTTTGGAGAAATCTGGCTTTGG 
GTCTATGAGAAGTAGTGTCTCTCTTTGTGCCTCTTCCCATTCTTTTTGACATTGAGCTCC 
ATGGTGCTCTGAATCCGTCTCTCACAGTGCTGATGGCAGGTGGGACAGATTAGAAAATAG 
AGCTGGAGCCACAGAGATTTGGCAGACTGATTTCGGTGCCCTCTTGGAATCTCCAGCACA 

22264 CTCCATGGTGCTCTGAATCCGTCTCTCACAGTGCTGATGGCAGGTGGGACAGATTAGAAA 
ATAGAGCTGGAGCCACAGAGATTTGGCAGACTGATTTCGGTGCCCTCTTGGAATCTCCAG 
CACATTCCAAAAAGCCTGGATAGGACCAAAATAGCTTATCAACGTGAGAAAGGACTTCAG 
AGCTTGTCTACTGCCAACCCTCATTTTACCCAATGAGGAAAGTGAAGCTATTAGGGGGCG 
AGGGACACGTGGAAGGTCACACAGCACACAGGAGGTGATTCACATGTAGATTTCAGCACC 
[T,C] 

GCTCCTGCCACGCTGGACTGGTTCACCTCCTAGGCTGACCCTGCCTCTCCCCTGTTCACA 
CACACTCTCGCACACACACACACACACACACACACACACAGGTGCTTTGTTCTGGCCAGG 
GGTTCCTAGGGTCACCTCTTGGTTGCAGCCACTGTGACCCCAACTGGTCTAACCTCTCTC 
TTCCCCTCCCACTTCCTTCCTGTGGTTCCTGCAGGAGATGTTCGGGAGCATGTTTCACGC 
AGAGACCCTGACCGCCCTGTGAGGGCTCAGCCAGTCCTCATGCTGCCTACAGAGTGCCTG 

22338 ACAGAGATTTGGCAGACTGATTTCGGTGCCCTCTTGGAATCTCCAGCACATTCCAAAAAG 
CCTGGATAGGACCAAAATAGCTTATCAACGTGAGAAAGGACTTCAGAGCTTGTCTACTGC 
CAACCCTCATTTTACCCAATGAGGAAAGTGAAGCTATTAGGGGGCGAGGGACACGTGGAA 
GGTCACACAGCACACAGGAGGTGATTCACATGTAGATTTCAGCACCTGCTCCTGCCACGC 
TGGACTGGTTCACCTCCTAGGCTGACCCTGCCTCTCCCCTGTTCACACACACTCTCGCAC 
E-,C,A] 

CACACACACACACACACACACACACAGGTGCTTTGTTCTGGCCAGGGGTTCCTAGGGTCA 
CCTCTTGGTTGCAGCCACTGTGACCCCAACTGGTCTAACCTCTCTCTTCCCCTCCCACTT 
CCTTCCTGTGGTTCCTGCAGGAGATGTTCGGGAGCATGTTTCACGCAGAGACCCTGACCG 
CCCTGTGAGGGCTCAGCCAGTCCTCATGCTGCCTACAGAGTGCCTGGCACTTGGGACTTC 
CATAAAGGATGAGCCTGGGGTCACAGGGGGTGTCGGGCGGAGGAAAGTGCATCCCCCAGA 

23363 CAGGGACCATGTGCTCTCCACACCCAGGAGTCTAGGCCTTGGTAACTATGCGCCCCCCGT 
CCATCATCCCCAAGGCTGCCCAAACCACCACTGCTGTCAGCAAGCACATCAGACTCTAGC 
CTGGACAGTGGCCAGGACCGTCGAGACCACCAGAGCTACCTCCCCGGGGACAGCCCACTA 
AGGTTCTGCCTCAGCCTCCTGAAACATCACTGCCCTCAGAGGCTGCTCCCTTCCCCTGGA 
GGCTGGCTAGAAACCCCAAAGAGGGGGATGGGTAGCTGGCAGAATCATCTGGCATCCTAG 
[T,C] 

AATAGATACCAGTTATTCTGCACAAAACTTTTGGGAATTCCTCTTTGCACCCAGAGACTC 
AGAGGGGAAGAGGGTGCTAGTACCAACACAGGGAAAACGGATGGGACCTGGGCCCAGACA 
GTCCCCCTTGACCCCAGGGCCCATCAGGGAAATGCCTCCCTTTGGTAAATCTGCCTTATC 
CTTCTTTACCTGGCAAAGAGCCAATCATGTTAACTCTTCCTTATCAGCCTGTGGCCCAGA 
GACACAATGGGGTCCTTCTGTAGGCAAAGGTGGAAGTCCTCCAGGGATCCGCTACATCCC 

23688 AAACTTTTGGGAATTCCTCTTTGCACCCAGAGACTCAGAGGGGAAGAGGGTGCTAGTACC 
AACACAGGGAAAACGGATGGGACCTGGGCCCAGACAGTCCCCCTTGACCCCAGGGCCCAT 
CAGGGAAATGCCTCCCTTTGGTAAATCTGCCTTATCCTTCTTTACCTGGCAAAGAGCCAA 
TCATGTTAACTCTTCCTTATCAGCCTGTGGCCCAGAGACACAATGGGGTCCTTCTGTAGG 
CAAAGGTGGAAGTCCTCCAGGGATCCGCTACATCCCCTAACTGCATGCAGATGTGGAAAG 
EG, A] 
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GGCTGATCCAGATTGGGTCTTCCTGCACAGGAAGACTCTTTAACACCCTTAGGACCTCAG 
GCCATCTTCTCCTATGAAGATGAAAATAGGGGTTAAGTTTTCCATATGTACAAGGAGGTA 
TTGAGAGGAACCCTACTGTTGACTTGAAAATAAATAGGTTCCATGTGTAAGTGTTTTGTA 
AAATTTCAGTGGAAATGCACAGAAAATCTTCTGGCCTCTCATCACTGCTTTTCTCAAGCT 
TCTTCAGCTTAACAACCCCTTCCCTAACAGGTTGGGCTGGCCCAGCCTAGGAAAACATCC 

24 210 TCACTGCTTTTCTCAAGCTTCTTCAGCTTAACAACCCCTTCCCTAACAGGTTGGGCTGGC 
CCAGCCTAGGAAAACATCCCCATTTCTAACTTCAGCCAGACCTGCGTTGTGTGTCTGTGT 
GTTGAGTGAGCTGGTCAGCTAACAAGTCTTCTTAGAGTTAAAGGAGGGGGTGCTGGCCAA 
GAGCCAACACATTCTTGGCCCAGGAGCATTGCTTTTCTGTGAATTCATTATGCCATCTGG 
CTGCCAATGGAACTCAAAACTTGGAAGGCGAAGGACAATGTTATCTGGGATTCACCGTGC 
[A,C] 

CAGCACCCGAAGTGCCAAATTCCAGGAGGACAAGAGCCTTAGCCAATGACAACTCACTCT 
CCCCTACTCCACCTCCTTCCAAGTCCAGCTCAGGCCCAGGAGGTGGGAGAAGGTCACAGA 
GCCTCAGGAATTTCCAAGTCAGAGTCCCCTTTGAACCAAGTATCTAGATCCCCTGAGGAC 
TTGATGAAGTGATCCTTAACCCCCAAGTAATCATTAACCCCCAGACCAGCCTCAGAACTG 
AAGGAGATTGTTGACCCAGTGACCTGGAGTTGAGGCTCAGGGAGAGATCTGCCACATGTC 
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